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Statistical Properties of a Sum of Sinusoids 
and Gaussian Noise and its Generalization 
to Higher Dimensions 


By JOEL GOLDMAN 
(Manuscript received September 6, 1973) 


This paper investigates the statistical properties of the sum, S, of an 
n-dimensional Gaussian random vector, N, plus the sum of M vectors, 
Xi, --:, Xa, having random amplitudes and independent arbitrary 
orientations in n-dimensional space. We derive expressions for the proba- 
bility density function (p.d.f.) and distribution function (d.f.) of S and of 
its length, |S|, as series expansions involving only the moments of 
[X.|,7 = 1, ---, M. In addition, we find the p.d.f. and df. of the pro- 
jection of S onto 1-dimensional space. Our results are generalizations of 
the n = 2-dimensional problem of finding the statistical properties of a 
sum of constant-amplitude sinusoids having independent uniformly 
distributed phase angles plus Gaussian noise. The latter problem has been 
treated by Rice! and Esposito and Wilson,” but our results can also deal 
with sinusoids having random amplitudes. When n = 8, our findings 
treat, in the presence of a Gaussian vector, the classical problem of ‘random 
flights” dating back to Rayleigh. Some calculations for the 2- and 3-di- 
mensional problem are presented, and an application to coherent phase- 
shift-keying communications systems 1s discussed. 


I. INTRODUCTION 
In a number of problems arising in communications systems, in 
multipath phenomena, and in other areas, the determination of the 
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statistical properties of a sum of sinusoids and Gaussian noise is 
important for evaluating system performance. For this reason there 
has been interest in this problem for a number of years. Rice! first 
investigated the statistical properties of the sum of a single constant- 
amplitude sinusoid and Gaussian noise. Later, Esposito and Wilson? 
considered this same problem but with two constant-amplitude sinu- 
soids having independent uniformly distributed phase angles. More 
recently, Rice* studied the properties of a sum of M sinusoids and 
Gaussian noise. In this paper, we look at the natural generalization 
of this problem to n-dimensional space; namely, we determine the 
statistical properties of the sum of an n-dimensional Gaussian random 
vector plus the sum of M vectors having random amplitudes and in- 
dependent arbitrary orientations in n-dimensional space. In the special 
case when n = 2, our results are applicable to the type of problems 
considered by Rice and Esposito and Wilson, but they can also deal 
with any number of sinusoids with random amplitudes. When n = 3, 
our findings treat, in the presence of a Gaussian vector, the classical 
problem of “random flights’ dating back to Rayleigh. 

In Section II we give a definition of spherically symmetric random 
n-vectors and state a theorem which characterizes the form of such 
vectors in an n-dimensional spherical coordinate system. We consider 
M independent spherically symmetric vectors, Xi, ---, Xa, and define 
S = >, X,. Using our characterization theorem, we show that the 
even moments, Z[|S|?*], k = 1, 2, ---, can be easily expressed in 
terms of only the moments of |X:|, 7 = 1, ---, M. Then with the 
normal vector N ~ 9 (0, o?I) independent of the X,’s, we derive in 
Section III the probability density functions (p.d.f.’s) and distribution 
functions (d.f.’s) of S + N and of |S + N| as series expansions in- 
volving the moments of |S|. In addition, we derive the p.d.f. and d.f. 
of the projection of S + N onto 1-dimensional space in terms of a 
similar series expansion. When n = 2 and M = 2, we check that our 
results agree with those of Esposito and Wilson for two constant- 
amplitude sinusoids. 

Last, in Section IV we present some calculations for the 2- and 
3-dimensional problems, and discuss some aspects of the computational 
procedure that we use. Certain of these calculations provide results 
for the probability of error of a binary coherent phase-shift-keying 
communications system operating in the presence of M co-channel 
interferers and Gaussian noise. These results extend previously pub- 
lished computations.‘ Additionally, our findings can be used to find 
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the probability of error of this system operating in the presence of 
more general types of interference. 


Il. SPHERICAL SYMMETRY 


The generalization of sinusoids with uniformly distributed phase 
angles are ‘spherically symmetric’’ vectors defined in the following 
way (see Refs. 6 and 7): 


Definition: A random n-vector X = (Xj, +--+, Xn), n 2 I, is spheri- 
cally symmetric with matrix 9 if and only if the covariance matrix of X 
exists,* H(X) = 0, and the joint characteristic function of X is of the 
form :t 


Px(u) = ELe**'] = h[(ugu’)*] (1) 


for some function h on [0, ~) and where 9 is some n X n» (symmetric) 
positive definite matrix. Actually, h and 9 are defined only up to posi- 
tive multiplicative factors. However, in this paper we are only con- 
cerned with spherically symmetric vectors with 9 = I = identity 
matrix. Then h is uniquely determined and @x(u) = A(|u]). We denote 
such a spherically symmetric vector by the notation ‘X is s.s.”’ 

Note that if X: and X, are two independent s.s. vectors, then clearly 
X, + X, is also s.s. 

The probability density function of an s.s. vector X can be found by 
Bochner’s theorem.’ If h(|u|) is absolutely integrable, then the p.d.f. 
of X is: 


px(x) = gn(|x]), (2) 


where 


1 


gn(r) = Gay po am | hA)A*PET (n-a2 Arid r>O0, nZl. 


Thus, if X is s.s., its p.d.f. is constant over every n-dimensional sphere 
centered about the origin. This vector is precisely what is meant by a 
“random flight” in a higher dimensional space. 

For our purposes, a more suitable characterization of an s.s. vector 
is given by the following theorem proved in Ref. 9. 


“ Expected value will be denoted by £(-). 
We denote vectors by boldface characters: u = (u1, «++, Un). The character wu’ 
is the transpose of u. The norm of u is denoted |u| = (uu’)}. 
For n = 1, a spherically symmetric random variable has an even characteristic 
function, By (u) = h[{p} |u|]. 
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Theorem 1: Suppose X = (Xi, ---, Xn), n 2 2, ts s.s. Then there exists 
a unique set of random variables R = 0, ®. € [0,7], k = 1,---,n —2, 
6 € [0, 2 ] for which* 


jo 
X; = R(T] sin) cos @; lsjsn-2 
k=l 
n—-2 . 
Xvi=Rf ( II sin 1) cos 6 (3) 
k=1 
n—2 : . 
Xn = R( II sin &:) sin 6, 
k=1 
and furthermore (R, ®1, «++, ®n—2, 0) are independent and have respec- 
tive p.d.f.’s: 


Dr) = 2nr!? ir ( 5) | ec) r=0 


= — —l 
po, (or) = T ( noert ) am} ir (7 5 : )| sin”!-*g, 
OS@S7 (4) 





kK=1,--:,n—-—2 
1 
= < 
po (@) ae 050< 27 


for the gn(-) of (2). 

Conversely, if (R, ®1, +++, Pn—2, 6) are independent and have the 
p.d.f.’s given by (4), and X ts defined as in (8), then X is s.s. 

The utility of this theorem lies in the fact that the random variables 
(R, 1, ---, @n-2, 6) are independent with specified p.d.f.’s. As an 
immediate corollary, we see from (2), (8), and (4) that: 


Corollary 1: Suppose X = (Xi, -°-, Xn), n 2 1, ts 8.8. Then its p.d f. 
ts given by: 
pa(x) = (2) (n/2) PED. 


({x|)77 
Moreover, forj = 1, ---, n and for all z, 
P(n/2)BU|X|*] _ TG@)ECI As] 
PL(n/2) + 7] Ty + 2) 


Using Theorem 1 we can prove: 


Theorem 2; Suppose X1, ---, Xu are independent s.s. n-vectors, n = 1. 
Let S; = S421 X:, 7 = 1, ---, M, and define 


* We define []z-, a, = 1. 
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uP = ET|S;|2*], k= 0, I tes, j =— 1, rey 
y 2m) EL |X,|?"], m= 0, 1, ney £ = 1, oe, 


——— 


Put 
A BL(Qm + 1)/2, (n — 1)/2] 
eS Beer 


where B(-,-) is the beta function. Denote 





(2m) — (0) (2) F (2m) 
Uj ie (uy » Up, tty By ) 
(2m) (0) (2) LL, (2m) 
Vj i (vf » YT; » Vj ) 


and define D,,; to be an (m+ 1) X (m+ 1) matrix whose (k, )th 
element equals 


2 — 2\ Cn,2k-2€n 2t-2k (4x) . 
' ' (2 > hk 
Gi 7 ) am vj > of 42k 


and is 0 af € < k. 
Then, for 7 = 2, ---, WM, andm =0,1,---, 


m [ 2m \ €n,2i Cn.am—2i ; 
2 == n,2tUn,2m—21 2 2n—2 
ape 5 (OP ) Ce ait sg pan, (5) 
i=0 U Cn,2m 
In matrix form this is 
(2m) (2m) 
Uj = w-7D,,,;, 
so that 
ue” = vP"™D no: Deis (6) 


for 7 = 2, ---, M. 
Proof: By Theorem 1 we have for each X; a corresponding vector in 
spherical coordinate space: 
X; > (R:, P15, aan Pn_-9, i, 0;). 
Since §; is a sum of independent s.s. vectors, it is also s.s., so there are 
vectors corresponding to it: 
Sp (Pp big 85: bases, Ma 
Note that uf” = HLP?*] and »f” = H[R?”"] and that 
E[cos* b, ; | = E{cos* £1; ] 
T'(n/2) Ma Mae be a 
= cos? a sin” ada 
P(s)0Le(n — 1)] Jo 
Cn,2i if 7 is even 
0 if 7 is odd. (7) 
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Denote the components of S$; and X; as follows: 


Sp Sig ees Sag) 


SN gy ee Xn) 
so that 
S13 = ba cos £1; 
X1,; = R; cos ¥4,;. 
Also, 


E(Si;] = ELP} cos* £1,;] 
= ELP;JE[cos? &,;] = 0 (8) 
if 7 is odd, and 


ECSY3] = ELP;" JE[Lcos’™ £1,;] 
= pe Cn om (9) 
and 
EL X24] = ECR? ]E[cos?*@,,;] = vPenon. (10) 


Noting that S1,;-1 is independent of X,,; since S;_1 depends only on 
Xi, ::+, X;-1 which are independent of X;, the following equalities 
follow from (8) to (10): 


Br Cpa = ELS? 
E{(S1,5-1 + X15 2"} 


= > ¢ Be )z Hsia l Eee Pe} 
- > ( ) EUSt)-1JELX33-*] 
_ > ( = BS ees] 


Hence, 
“em = > ) Cn,24 Cn, 2m—2i pd y2m—20 


1=0 Cn,2m 
The vector equation 
uP = wAPDrig j= 2,-:-,M (11) 


follows immediately from this wee Since u?” = v?™, eq. (11) 
implies that uf?” = vP™Dz»2-++Dnj-1Dn, for j = 2,---,M. For 
n = 1, we prove (5) directly. 
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In Reference 9 we proved the result: 
Theorem 8: If X is s.s. and independent of N ~ 91(0, o°I), then the p.d.f. 
of X + Nis: 


nr 


1 = 1 
Pain(®) = gaan? (3) Pr ear 
1 v|Z 
xexp | — sr + [2/9] Tooe(“Zl)n a1, a9) 


where I,(-) ts the vth order modified Bessel function of the first kind. 
From this theorem we obtain the following: 


Corollary 2: Suppose X is s.s, independent of N ~ 2(0, oI), and 
De £C/20e)*/t!JEL|X|2*] < © (for example, if |X| is a bounded 
random variable). Let Z be the projection of X + N onto 1-dimensional 
space, 1.e., Z is (say) the first component of the vector X +N. Then 
for n = 1 the p.d.f.’s of X + N, of |X +N], and of Z are given, respec- 
tively, by: 


Pxin(Z) = ehe exp (— a 21") 
LE O21, | 22/20!) (— 1/26°)BE |X |] 




















. x TL (n/2) +7] (13) 
Pixewi(0) = Gaya OT eP (- a ) 
pee ates, 144 
and 
pele) — BS ex (- 5) 
, pee a See TEE ies 


where Li(-) are the generalized Laguerre polynomials. In addition, 
for n = 1 the “distribution functions” of |X + N| and of Z are given, 
respectively, by: 


1 n @ a? \nl2 a 
Pr{IX+N| >} = nal (595) — (aa) exp (— 93) 


ye Se Lie? (at/ 20%) (— 1/20*) BL |X |] 


i=l iL (n/2) + 7] 46) 
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and 


Pr (Z >a} = derfe ( a ) 4 Tol?) exp ( =) 








(202)! } = 2 (270?)3 Qa? 
2. Li, (a?/20°) (— 1/20?)' HE |X|") 
or CYL) 


where T'(-,-) ts the incomplete gamma function (Ref. 10, p. 337) and 
erfc (-) ts the complementary error function. 


Proof: From Ref. 10, p. 242, we have the generating function 


Aig lle Se hee) aye 
(Gaye e@i) = 2 Tere D 


With t = v/V20?, x = |z|/V20?, anda = (n — 2)/2, we substitute (18) 
into (12) to get: 


1 n 1 2 
Dytn(Z) = soot (5) exp | - a3 21" / dup |x) (v) 


1 ES LE @~2)/21( | 7 |2/293) (— 1/20?) ty? 
(20?) "9? io TL (n/2) + 7] 


T 2 1 
_ oe — (- x 2") 
= Lfe-¥/1(|2|2/26%) (— 1/20%)*ELX |] 
xd P(n/2) + 7] , 


é 





(18) 


x 


(19) 


assuming that the interchange of integration and expectation is valid. 
The second assertion of the corollary follows from Corollary 1 since 


Qrrl2 


Pixtni({Z|) = P(n/2) [Z| *—"'ps4n(Z). (20) 


To prove (15) we note that Z = X,+ N, is a 1-dimensional s.s. 
random vector and we apply eq. (13) (with n = 1) and the second part 
of Corollary 1 to obtain the desired result. 

Next, to show (16) we integrate (14) over the interval (a, ») and 
utilize the relationships : 


y2 1 n a 


ie a 
a v™— exp (- a) dv = 5 (207) "7 G m) (21) 
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and forz = 1, 


Pte v Cin—2/2] ( _&* 
f v™—! exp 5g L; 5g dv 
a y= (a?/2a?) 


1 
= —_— = n — 9 
5 @" exp ( 52 ; (22) 


Equation (21) is given in Ref. 10, p. 337, and (22) is proved in the 
appendix. 

Finally, to obtain (17) we integrate (15) and use eq. (22) withn = 1 
and the definition: 


erfe (x) = = f exp (—#*)dd. 


It remains to justify the various interchanges of integration and 
expectation or summation. For example, to validate the interchange 
In (19) it suffices (Ref. 11, pp. 28-29) to show that 


= (1/20*)* 
‘=o TL (n/2) + 7] 


Since (Ref. 12, p. 207) |L@(y)| Se"Tati+ l/l + I), 
the expression in (28) is less than or equal to: 


2 (1/20%! : jz|?\ 1 PL (n/2) +7] 
& oT (n/2) +7] E(|X| J exp ( 4g? ie I (n/2) 


2 
EC |X|**] igen (BP) <0. (28) 











which is finite by hypothesis. 

The utility of this corollary lies in the fact that we can evaluate the 
various p.d.f.’s and d.f.’s knowing only the moments of |X| and not the 
entire distribution of X. 


Ill. STATISTICAL PROPERTIES OF THE SUM OF INDEPENDENT 
SPHERICALLY SYMMETRIC VECTORS AND GAUSSIAN NOISE 
For simplicity, we combine the results of Corollary 2 and Theorem 
2 into: 


Theorem 4: Suppose Xi, ---, Xu are independent s.s. n-vectors,n = 1, with 
moments vP™ = EL |X,|2"], m = 0, 1, ---, and ¢ =1, ---, M, which 
are also independent of N ~ 9(0, o°I). Let S = DUiL, X; and assume that 
Diixo (1/20*)*/EL|S|2*]i! < © (for example, if the |X;|’s are bounded 
random variables). Let Z be the projection of S + N onto 1-dimensional 
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space. Then the following relations for p.d.f.’s and d.f.’s are valid: 


Dsin(Z) = Becia exp ( — si z|*) 


se LEe-2/1(| 22/208) (— 1/20*) ni 




















{=0 PL(n/2) + 7] » (4) 
2 1 
P\s+tnN} (v) = (2e2)*? grt exp (- 5g? ) 
 LE'*~9/9(92/292) (— 1/20") ul? 
on as Ce ae 
_ T(n/2) ee 
p2(2) = (27a?) ae ( i) 
co L{-» (22/20?) (— 1/207) u@ 
SENT oy OM 7 ST WORN 26 
se 2X TL (n/2) + 2] = 2D) 
1 n @ a \ne 
Pr {|S + N| >a} = ray ($32) — (5) 
at, Li? (at/2a%)(— 1/20?) uh 
xew(— 35) 5a 
and 
_l a _ aT(n/2) 
Pr{Z >a} = x erte ( orn | 2 Qo)! 
_ @Y & LPs (@?/20?) (— 1/20?) uf? 
pe ( a) 2 aaa Oe 
The moments u% 2 EL|S|2*] are determined by the recurrence relations 
ie = = ( . ) ae mnt 29) 


for j = 2,---, M with 


a, = BL@m +1)/2, (wm = 1/2] 
oe BEB, (n — 1)/2] 


or by the matrix equation (6). 
We next look at some special cases. 


A. n = 2-dimensional space 


When n = 2, eqs. (24) through (29) reduce in an obvious manner. 
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The incomplete gamma function in (27) equals (Ref. 10, p. 339): 


a? a? 
t (195) -e(- 5). (30) 


The Laguerre polynomials L{-» and L in (26) and (28) can be ex- 
pressed in terms of Hermite polynomials H;(-) using the relations 
(Ref. 10, p. 240): 

*y-3 5:44 (23) 


1 —1 
Ly? (e) = wen 





(31a) 
and 
(—1)*H2; (2?) 


L{-? (2) = Toe (31b) 


For example, eq. (28) can be rewritten as: 


2 
Pr {Z>a} = gente (—2-) + Zen (- 35) 
oO 


© H»:-1(a/V20?) (1/20%) pe 
ss p> (7 1)2024 - (32) 


& Cn 2k Cn, 22k 7\? 
2k Cnoe NE 


ti /7\2 
21) _ 2k), (2i—-2k 
we = (1) ween 





We also check that 


so that 


I 
M-. 


2 (2k), (2i—2k 
) Vj yf ie 


(2%) : é a\P(t\ (2k) (2f—2k). (2i—2¢ 
) = = 
W305 = pz) a (1) ba) Vy yf yet , 


and so forth. 

Consider the type of problem investigated by Rice!? and Esposito 
and Wilson? in determining the p.d.f.’s of the envelope and instan- 
taneous value of 


M 
z(t) = om Az cos (wyt + 4.) + n(d), 


where each A, = 0 is independent of 6, and 4; is uniformly distributed 
on [0, 27). Assume that the pairs {(Ax, 4,)} are independent of each 
other and of n(-). Suppose n(¢) is the result of the passage of zero- 
mean white stationary Gaussian noise through a bandpass symmetrical 
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filter. Then n(t) can be written as (Ref. 13, pp. 142-148): 
n(t) = ni(t) cos wot — ne(t) sin wel, 


where n;(¢) and n2(t) are zero-mean independent stationary low-pass 
Gaussian processes with 


o = E[n(t) P = EL)? = Elne(t)f. 
Let 6.(t) = (we — w.)t + 6, and thus: 


2(t) 


= Apeostnd EO Ea) 


I 


M 
> Ax cos Lwot + 64(t)] + ni(t) cos wot — ne(t) sin wet 
k=1 


bs A, cos 0x(t) + mid | COS Wet 
=1 


M 
= | > Ax sin 6;.(t) + nal’ | SiN Wol 
k=1 
= A(t) cos [wot + y(t) ]. 
At any time ¢, let 0, = 6x(to), m1 = mi(t.), and ne = no(t.). Put 


X, = (Ax cos 6;, Ax sin 6x), k=1,---,M, 
and 
N = (n1, Ne). 


Then 5-41, X, + N iss.s., so by Theorem 1 it has the form (I cos ¥, 
I sin VY), where T = 0 is independent of WY and W is uniformly dis- 
tributed on [0, 27). It follows that 


z(t.) = I cos ¥ cos wot, — I'sin V sin wt, 
T' cos (wot + WV); 


that is, [ = A(t.) and YW = y(t,.). Hence, at any time ¢., A(t.) and 
y(t.) are independent and y(t.) is uniformly distributed on [0, 27). 
Moreover, the p.d.f. of the “envelope” A(t.) is the p.d.f. of 
r= |>°>#_,X,+N| which can be determined from (25). Thus we 
can find the p.d.f. of the envelope of the sum of Gaussian noise plus 
any number of sinusoids with random amplitudes and independent 
uniformly distributed phase angles. The case considered by Esposito 
and Wilson? was that of M = 2, Ai = a= constant and A, = b 
=constant, in which case 


ye = EL|Xsl?"] = a, 
yf = BL|X2|?] = be, 
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and 
ps? = De ) Q2kp2i-2k. 
k=0 


The envelope p.d.f. is then: 


priv) = =; exp —0*/20?] 
aw Ls(v?/267)(— 1/207)? - , fess 
t 2kp2i—2k 
oe a! | 2 (z) oe | 


which agrees with the result in Esposito and Wilson [ Ref. 2, eq. (12) ]. 
This expression was obtained earlier by Goldman." 
To find the p.d.f. of z(t) at some time instant ¢,, note that: 


z(t.) = A(t.) cos [wote + ¥ (to) ] 
T' cos (Woto + W). 


Since WV is uniformly distributed on [0, 27) and the cosine function 
has period 27, the p.d.f. of T cos (wot. + WV) is the same as that of 
T' cos VW. Recall that ([ cos ¥, I sin ¥W) = )°74, X, + N. Thus, [ cos v 
is the first component of the 2-dimensional vector }°#£, X, + N and, 
from eq. (26), its p.d.f. is: | 


Pz(to) (#1) = Pr cos y (21) 
ee L a) sh (= 1/20%) ust? 
~ Qno2yi OP (= 208 #) % il 
2 
x LY? (ss ): (33) 
In Esposito and Wilson’s example, this becomes 


4 1 4) & (= 1/20) 
D2 to) (21) — (Qr0”)3 exp ( oe? 4) > il 


«x Lf» ie 3 7\ 2kpei—2b- 
: 20? kao \k 7 : 


which agrees with their eq. (29). 
We also check that the d.f. in (27) is the same as that obtained in 
eq. (18) of Ref. 2, when we use the fact that (Ref. 10, p. 241): 


tL, (2) = TL 21(2) — LP (x) ]. 


Finally, consider a binary coherent phase-shift-keying communica- 
tions system operating in the presence of Gaussian noise and M 
co-channel interferers modeled by a sum of constant amplitude sinu- 
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soids with independent, uniformly distributed phase angles, 61, ---, @.1. 
(Details of this model and system may be found in Refs. 4 and 5.) 
The probability of error in such a system is: 4 


M 
Pe = Pr} > b;cos6#; + Ni >a}, 
i=1 


where N; ~ 9(0, o?) and a is the amplitude of the transmitted (de- 
sired) signal. This probability of error is given by the expression in 
(82) and agrees with the result found in Refs. 4 and 5. However, eq. 
(32) can also be used to find the probability of error in this system for 
a more general class of co-channel interference consisting of a sum of 
uniformly phased sinusoids having also independent random ampli- 
tudes. 


B. n = 8-dimensional space 


When n = 38, eqs. (24) through (29) reduce in a straightforward way. 
Equations (24) to (26) and (28) can also be expressed in terms of 
Hermite polynomials by employing eq. (31). The incomplete gamma 
function in (27) can be written in terms of tabulated functions by use 
of the relations (Ref. 10, pp. 389-340) : 


T(e + 1, x) = cl (c, x) + xe” 
and 
P'(2, x) = mw erf (z+), 


where erf(-) is the error function. 
The recurrence relation for the moments becomes, for M = 2, 


up? = So (2) ( 2h AL) ( 2 = 2 + 2)(2 +1) oy, rw 
imo \ 2k J\ 2k £1 )\ 2 — 2k 1) +2 


and so on for higher values of M. 








IV. SOME COMPUTATIONS 


The form of the expressions in (24) through (28) is quite similar, 
and so the computer programs used for their evaluation were only 
slight modifications of one basic (Fortran IV) program. Different 
values of n could also be treated easily. The basic program required 
computation of a sum of the form: 


2 LY (x) (— 1/207) us? 
i=0 I'L (n/2) + 1] 


where x is a variable. 


(34) 
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Fig. 1—Plots of p.d.f. of |S + N] for o? = 
vector lengths (6, bs, b3). 


1, M = 3, n = 2, and different sets of 


In one part of our program, the moments p{7 were determined from 
eq. (29): 


ny? = = Ca(z, k) pvp, 1 a ae M, (35) 
where 
: A 22 \ Cn,2% Cn 2i—2k 
Ch (2, k) a, ee) Cn24 (36) 


Using the definition of c,,.2. and properties of the beta function, we can 
show that the coefficients C,.(1, k) are equal to: 


Cn(t, k) 
7 TG + 1) (n/2)TL(n/2) + 7] (37) 
Me+ DG —k+ 10 (n/2) + kL (n/2) +7 -— kb] 


To efficiently compute these coefficients and to eliminate “overflow” 
problems, we utilized the simple recurrence relation 


CG es 1)[(n/2) +i — k] 


Late es Pe Se) 
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Fig. 2—Plots of p.d.f. of |S + N| for o? = 1, M = 4,n = 2, and different sets of 
vector lengths (61, be, bs, b,). 
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Fig. 3—Plots of p.d.f. of |S + N]| for o? =1, M = 6, n = 2, and vector lengths 
all equal to b. 
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Fig. 4—Plots of p.d.f. of |S + N| for o? = 1, M = 3,n = 3, and different sets of 
vector lengths (1, be, bs). 
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Fig. 5—Plots of p.d.f. of |S + N] for o? 
vector lengths (61, 62, b3, 54). 
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Fig. 6—Plots of p.d.f. of |S + N| for o? 
all equal to 6. 
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Fig. 7—Plots of Pr {Z > a} forn = 2, 10 logio (a2/>-™, b?) = 6 dB, bi =--- = by, 
and for various values of M. 
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Fig. 8—Plots of Pr {Z > a} forn = 2, 10 logio (a?/35*, b?) = 8dB, b} =--- = by, 
and for various values of M. 


together with C’,(z, 0) = 1. To evaluate uf from eq. (35), particular 
sets of moments {vf} could be read into the program. However, for 
simplicity we chose spherically symmetric vectors having constant 
lengths bi, ---, Dar. 

The second part of the program was concerned with computation of 


nvin(-)/a(54) 


In order to avoid ‘‘overflow”’ difficulties, we actually computed 


' ae ' 1 rs 2 
L$ (a) /T € + i) with A='—s35 ( De bs) 
9 20° \ 1 
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Fig. 9—Plots of Pr {Z > a} forn = 2, 10 logio (a?/ 20%, b3) = 10 dB, b; =--- = bm, 
and for various values of 


To do this efficiently we used the iterative identity : 
L{® (x)né _ (Q+a-—-1- x) y L@, (2) 
Pm) +a) a @/yeay NO 
SMe) 
TL (n/2) + 7] 
[which follows from the Laguerre polynomial recurrence relation 
(Ref. 10, p. 241)], together with the fact that L§(x) = 1 and 
Li (*4) =a+1—~-z. 
The final part of the program computed the sum: 
< L{ (x)n* (2i) 
ee NS 40 
Xe Taja) Fay se 
where a9 = pf/ (0, b,)?*. (The factor 1/ (3074, 6,)?* was built into 


MNLM(xz), i222 (39) 
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the computation of eq. (35) in order to find af. ] A convergence check 
was provided to end the summation after additional terms did not 
change any significant digits. Although the program was written to 
handle up to 200 terms in the sum, many computations required less 
than 50 terms. As Esposito and Wilson? also noted, for certain values 
of x, o, and {b;}, the terms in (40) alternate in sign and have magni- 
tudes of the order 10". For these cases, precision and convergence could 
not be guaranteed. The typical CPU time required to compute eq. (40) 
for 200 values of x was about 10 to 20 seconds in double precision 
arithmetic on the IBM 370/165 system. 

Some representative results of these computations are shown in 
Figs. 1 to 12. Figures 1 to 6 are plots of pjsin)(v) as a function of v 
for o? = 1, for various values of n and M, and for s.s. vectors having 
constant lengths b;, ---, bar. Curves for n = M = 2 were given in 
Ref. 2. Figures 7 to 12 are plots of Pr {Z > a} versus the quantity 
10 logis (a?/20*) for fixed values of the quantity 10 logio (a2/>°74, 63) 
and for various values of n and M. In these curves, for simplicity, we 
took 6b; = bp = --- = bar. As we discussed in the last section, the plots 
in Figs. 7 to 12 represent the probability of error of a binary coherent 





10 LOG10( a2/ 202 ) IN dB 


Fig. 10—Plots of Pr {Z > a} forn = 3, 10 logio (a2/>-%4 6?) = 6 dB, b; =--- = bau, 
and for various values of M. 
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Fig. 11—Plots of Pr {Z > a} forn = 3, 10 logio (a2/ 0%, b?) = 8 dB, b1 =-+-- = bu, 
and for various values of M. 


phase-shift-keying system versus signal-to-noise ratio, 
SNR = 10 logio (a2/20*) (dB), 


for fixed values of signal-to-interference ratio, 


M 
Sih S10 lows (« Vi > vt) (dB). 
i=1 


These results extend those previously found in Refs. 4 and 5 to larger 
values of SNR and smaller values of SIR. 


Vv. CONCLUSION 


In this paper we presented expressions for the p.d.f. of a sum of 
spherically symmetric random vectors plus a Gaussian vector in n-di- 
mensional space. We also found expressions for the p.d.f. and d.f. of 
the length of this sum and of the projection of this sum onto 1-dimen- 
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Fig. 12—Plots of Pr {Z>a} for n=3, 10 logio (a2/D*%, 6?) =10 dB, bi =--+ =by, 
and for various values of M. 


sional space. All of these expressions were series expansions involving 
only the moments of the length of the sum of the s.s. vectors. These 
moments could be found from recurrence relations also derived in the 
paper. Some computations of the p.d.f.’s and d.f.’s were presented for 
the 2- and 3-dimensional cases, and an application to a communications 
system was discussed. However, as pointed out earlier in Refs. 2 and 3, 
there are sometimes difficulties in evaluating these p.d.f.’s and d.f.’s 
for certain parameter values, even for the case of s.s. vectors having 
constant lengths. 


APPENDIX 
To prove eq. (22) we use the fact (Ref. 10, p. 241) that, for 7 2 1, 


£ Leto Liat? (t)] = te tel (0). 
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Integrating this expression over the interval (a, ©) yields 


— age Let (a) = ; eta L (t) db. (41) 


a 


Equation (22) follows from (41) after a simple change of variables. 
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Experimental Investigation of a Linear 
500-Element 3-Phase Charge-Coupled 
Device 


By C. H. SEQUIN 
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A linear, 500-element, 3-phase charge-coupled device, originally built 
as a high-resolution linear image sensor, has been chosen as a representa- 
tive structure for single-level, 3-phase charge-coupled devices to exem- 
plify the performance of such devices in detazt. 

Charge-handling ability and transfer efficiency have been studied as a 
function of various design parameters and operating conditions. Most 
of the observed functional dependences are well understood and agree 
with the expectations based on model calculations. However, various 
problems are encountered in these structures. An unusually wide spread 
of the performance of different devices and slow instabilities are observed. 
They are attributed to a lack of control of the interface potential in the 
gaps between the transfer electrodes. 

Some emphasis is placed on a more detailed description of the various 
measurement techniques used. These techniques are of a general interest 
since they are applicable to other charge-transfer devices. 


1. INTRODUCTION 


The principle of charge coupling was first conceived using a 3-phase 
technique with simple, nondirectional transfer electrodes.! This ap- 
proach using all identical electrodes in a single level of metallization 
has been successfully demonstrated on several successive designs of 
linear devices; each structure has a greater number of elements with 
smaller electrode length than the previous design.?-* Correspondingly, 
improved efficiency at higher frequencies has been observed. 

The charge-coupled device (CCD) discussed in this paper was de- 
signed as a high-resolution linear image sensor with 500 three-phase 
elements at a spatial period of 18 um, capable of reading half a line of 
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a normal page of printed material. The device has also been demon- 
strated as an analog shift register able to delay a line of the Picture- 
phone® video signal.’ In these demonstrations, a few of the best 
devices have been used, and their performance has been described in 
the context of these particular demonstrations only. The following 
discussion enlarges the description of the performance of 3-phase 
single-level metal CCD’s by reporting more representative results ob- 
tained in a study of many devices. The goal of this study was to obtain 
an understanding of the functional dependences of charge-handling 
ability, transfer efficiency, and dark current on operating potentials, 
frequency, and temperature. In addition, devices with different 
electrode lengths and different fabrication technologies have been 
compared. However, similar devices on different slices show a wide 
performance spread and slow instabilities and show that some results 
are even irreproducible. All this is attributed to a lack of control of the 
interface potential in the gaps between the transfer electrodes. 

The techniques used to determine signal-handling ability, transfer 
efficiency, or dark current profiles are described in some detail, since 
they are applicable to the investigation of other charge-transfer 
devices. 


Il. THE 500-ELEMENT DEVICE 


The device discussed in this paper was designed as a linear image 
sensor with high-resolution density. At the same time, the device was 
supposed to serve as a test structure for the high-frequency perform- 
ance of 8-phase CCD’s. To obtain good transfer efficiency, a 
“smooth” interface potential profile with no barriers or pockets is 
required, which transfers the minority carriers under the influence of 
electric fringe fields from electrode to electrode. Extensive computer 
modeling studies® have shown that this condition can be achieved in 
a 5 X 10 cm- silicon substrate with 3000 A of SiO. as an insulator 
and electrodes shorter than 5 um. If, in addition, the gaps between the 
electrodes are made 3 wm or smaller, the device can be operated at 
pulse amplitudes of 15 V and no barriers will form underneath the 
transfer gaps for charge densities at the Si-SiO, interface ranging from 
about 5 X 10° to 2 X 10!! cm~?. Thus, the unit cell was made as small 
as possible using available technology. Individual electrodes, nominally 
3 wm long, were arranged at a spatial period of 6 wm, leading to a cell 
length of 18 um. The calculated transfer time constant for electrons 
lies in the subnanosecond range. Ideally, the transfer efficiency in the 
frequency range of this study should then be limited by the effects of 


582 THE BELL SYSTEM TECHNICAL JOURNAL, APRIL 1974 


ae 
i ene 





Fig. 1—One end of the linear 500-element 3-phase CCD. Shown are the three 
sets of transfer electrodes (E1, E2, and E3), the input diode (D), and the input gate 
(G). Electrodes E2 are alternately contacted to diffused crossunders (C) through 
contact windows (W) lying on either side of the transfer channel. 


interface states. This was indeed found true for the best devices, which 
showed no problems associated with the bare transfer gaps. 

A 3-phase CCD needs one crossing of electrodes per element, which 
in this device are realized with diffused crossunders. They are intro- 
duced by the same phosphorous diffusion (10!* cm~*) that forms the 
input output diodes. The crossunder bus line that addresses electrode 
system #2 (Fig. 1) is repeated on either side of the transfer channel, 
and the transfer electrodes H2 in subsequent clements are contacted 
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alternately to either one of them. In the resulting structure (Fig. 1), 
which has a periodicity corresponding to two CCD elements, con- 
siderably larger and thus more reliable contact windows can be formed 
than in a conventional approach with only one bus line. 

The channel width, defined by a channel-stopping boron diffusion 
(10'7 cm=*), was designed to be 15 ym, taking into account a lateral 
diffusion of 1.5 um. This width was chosen as a compromise between 
the preference for higher signal levels and the reduction in the vertical 
resolution that would be caused by too wide a channel if the device 
is used as a line scanner with no separate means to restrict the light- 
sensitive area. 

The transfer channel is terminated at both ends by input output 
diodes. They are electrostatically shielded from the pulsed transfer 
electrodes by a gate electrode, which in normal operation is kept at a 
de potential. The whole device is surrounded by large substrate con- 
tacts which serve as points of reference for the driving pulses. A 
special small substrate contact near the output diode can serve as a 
reference ground for the video signal. Including these features, the 
device dimensions are 230 um X 9150 pm. 

Devices were fabricated originally on 10-ohm-cm p-type silicon sub- 
strates, with 3000 A of SiO.. Devices with 1500 A of SiOz, or with a 
double insulator structure consisting of 1200 A of SiO, and 500 A of 
Al,O3, were also built for comparison. 

In most devices, the transfer electrodes were chemically etched out 
of 1500 A of RF-diode sputtered tungsten. Some batches, however, 
used Al or backsputter delineated’ Ti-Pd-Ni metallization. With two 
sets of masks, various exposure times of the photoresist, and various 
etching procedures, the electrode length could be varied in different 
batches from about 1.5 um to 4.5 wm. After metallization, the devices 
were subjected to annealing treatments to reduce the interface state 
density. Most commonly, the devices with refractory electrodes were 
heated in a hydrogen atmosphere at 700°C for 1 hour; the Al devices 
were annealed at 380°C. A considerable improvement in transfer 
efficiency was normally observed. 

In operation, many devices showed a strong sensitivity to changes of 
the ambient, which could be demonstrated by breathing onto the 
surface. To reduce these effects, some devices were protected with a 
second dielectric level, such as 1 um of a phosphorous glass or 1000 A 
silicon nitride. 

Most of the results presented in the following sections were obtained 
on devices with 3000 A of SiO., with electrodes 3.5 um long of RF- 
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diode sputtered tungsten, and with no protective dielectric layer on 
top of the electrodes. Deviations in behavior due to different tech- 
nologies will be discussed in a special section. 


I. EXPERIMENTAL PROCEDURE 


After screening tests on the uncut wafer, working devices were 
mounted on ceramic substrates for the investigation. A few selected 
devices have been demonstrated as line image sensors‘ or analog delay 
lines. The following sections present a detailed report of studies 
carried out on several different devices with reasonably good 
performance. 

The mounted devices were investigated in a test setup, illustrated in 
Fig. 2, built around an optical microscope. A set of TTL logic, three 
sets of pulse drivers with different rise times, and several preamplifiers 
with various bandwidths were used to investigate the devices in the 
frequency range from 1 kHz to 17 MHz. The devices could be operated 
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Fig. 2—Schematic layout of test setup. 
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as analog shift registers with the driving pulses applied continuously 
and charge packets injected periodically through the input diode. 
Alternatively, the devices could be held in the storage mode, inte- 
grating the dark current or carriers generated by light projected onto 
it. A sharp spot of light, about 2 um in diameter, projected through 
the microscope, was especially useful in these investigations. By moving 
the spot along the device, singularities in transfer behavior could be 
localized and often associated with visually observable defects. The 
built-in object illumination of the microscope was used in addition to 
provide a fairly uniform background. 

The accumulated charge could be read out at various clock rates 
after various integration times. The driving pulse shapes, pulse ampli- 
tudes, and bias potentials, and the mutual overlap between subsequent 
pulses could also be varied to study their effects on the performance 
of each device. 


IV. CHARGE HANDLING 


The basic task of any charge-transfer device is to carry charge along 
the transfer channel. An important characterization of a device is, 
therefore, the maximum amount of charge that can be handled at a 
given amplitude of the driving pulses. In the following section, the 
functional dependence of the charge handling ability on various 
operating parameters will be investigated. 

The maximum amount of charge that can be carried in the potential 
wells underneath the transfer electrodes for a given set of operating 
conditions 1s measured by observing the output signal as the device 
is driven into saturation. In the shift-register mode this can be achieved 
electrically by injecting more and more charge from the input diode. 
In the storage mode a light spot is used to fill a single well until it 
starts to spill into neighboring elements. In both cases, beyond satura- 
tion the output signal pulse starts to widen rapidly and its amplitude 
increases only slowly and often in an irregular manner. 

The amount of charge that a transfer pad can hold is approximately 
determined by the product of the oxide capacitance underneath the 
pad C,. and the applied voltage Vez + Vp. Fringe effects increase that 
capacitance somewhat. On the other hand, not all the applied voltage 
will appear across the oxide. The interface potential for a full bucket 
will still be larger than zero. In fact, it cannot be lower than the 
barriers produced by the isolating electrodes, which are kept at a 
voltage Vr, without spilling charge. Neglecting fringe effects and the 
influence of the depletion capacitance, one expects a linear relationship 
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Fig. 3—Charge-handling ability as a function of applied pulse amplitudes for 
various waveforms. 


between the driving pulse amplitude Vp and the signal-handling 
ability: Q = C..:Ve. 

Experimental results derived from the saturation value of the output 
signal show good agreement with this approximation (Fig. 3). For 
ordinary 3-phase operation with square pulses, the slope of the curves 
corresponds to an electrode capacitance of 5.6 X 10- F, which in 
turn corresponds to an area of 50 um? on 3000 A of SiOe. With a 
channel width of 15 um, this yields a calculated effective electrode 
length of 3.3 um, which agrees with the observed length within the 
measurement accuracy of 0.5 um. 

Curves taken at various pulse rates are parallel but do not fall on 
top of each other. Somewhat lower charge handling is observed as the 
frequency increases, and the corresponding extrapolated curves 
cross the abscissa at higher values of Vp. Comparison of measurements 
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taken at a fixed clock rate but with pulses that have a different fall 
time show reduced signal amplitude for the slower pulses. This in- 
dicates that the frequency dependence of the charge-handling ability 
may originate from the degradation of the driving pulses. Stray 
capacitance and the resistance of the diffused crossunders slow down 
the response time and reduce the pulse amplitude at higher frequencies. 

The device was also operated with sine waves and the maximum 
signal charge plotted as a function of peak-to-peak amplitude. The 
expected values in this mode are 75 percent of the square-wave 
operation, since, when one phase is at its maximum value Vp, the two 
neighbors are at a voltage Vp(1 + cos 120°)/2 = 0.25 Vp. Experi- 
mentally observed values fall well upon the calculated straight line 
through the origin. These measurements show a much smaller fre- 
quency dependence. 

A 3-phase device can also be operated in an asymmetric 2-phase 
mode by leaving one set of electrodes at an intermediate de potential 
V.. Using a simplistic model that neglects fringe effects, one would 
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Fig. 4—Charge-handling ability in asymmetric 2-phase mode of operation (see 
inset) as a function of the dc potential of the static phase for various pulse amplitudes. 
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expect that the largest signal current could be carried when this 
intermediate potential V2 is at half the pulse voltage Vp, and that for 
this case the signal handling should be half that of the normal 3-phase 
operation. 

Experimentally, it is seen that for all measured values of Vp, the 
maximum signal current is reached for a de potential V2 equal to about 
one-third of Vp (Fig. 4). For this case, the charge-handling ability 
is about 40 percent of that observed in normal 3-phase operation. This 
is due in part to the fact that for higher values of V> the transfer 
efficiency decreases, owing to the formation of insuppressible barriers 
between the de phase and an adjacent phase that is fully turned on. 
Also, in this structure that has gap widths comparable to the electrode 
lengths, the gaps themselves may play a significant role in the charge- 
storage process. The inset of Fig. 4 illustrates that fact. In this model 
the de phase V2 turned on to Vp/3, together with the two adjacent 
gaps, can store the same amount of charge as the well underneath 
phase 1 or 3 in the asymmetric 2-phase mode. 

The important role of the potential in the gaps is also expressed in a 
strong dependence of the performance on the resting potential Vr. 
The inset of Fig. 5 shows the function of Ve serving as a bias on top 
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Fig. 5—Charge-handling ability as a function of resting potential Vr (see inset for 
definition) for normal 3-phase operation and for the asymmetric 2-phase mode. 
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of which the pulse potentials and, if applicable, V, are floating. Figure 5 
then shows the dependence of the charge-handling ability on the rest 
potential Vz. In the normal 3-phase mode, the operating range of Vr 
is about 8 V. This is a rather high value for a 3-phase CCD. Typically, 
these devices are much more sensitive to Vz and show ranges of only 
1 to 3 V. The range of Ve increases for higher pulse voltages Vp, be- 
cause the stronger fringe fields can suppress larger barriers in the gap. 
In the asymmetrical 2-phase mode, where the voltage difference on 
adjacent pads is smaller, a smaller operating range of Vz is found. 

Switching all electrodes to the same dc potential allows one to 
operate the device as a long IGFET. Figure 6 shows two sets of curves. 
The dashed lines are the drain current versus gate voltage curves taken 
in a point-by-point measurement starting at low values of Vg. The 
electrodes were then held at a de potential of 30 V for 90 minutes, and 
the measurements were repeated working from high toward low values 
of Vg. The strong hysteresis observed is produced by the slow time con- 
stants involved in charging the gaps to the potential of the electrodes. 

Values for the carrier mobility deduced from steady-state curves 
obtained after the device had been sitting at a certain gate potential 
for a sufficiently long time are in fair agreement with measurements 
taken on ordinary test IGFET’s with continuous gate electrodes. In 
both cases, the values range around 700 cm? V~ s7, 


V. TRANSFER INEFFICIENCY 


5.1 Introduction 


For practically all applications of a charge-coupled device, the most 
crucial parameter of the device is its transfer inefficiency. Figure 7 
shows some calculated output pulse trains, produced in response to 
the injection of single charge packets into the input of devices with 
different overall transfer performance. This computation was done 
using a linear small-signal approximation,® which assumes that in each 
transfer every charge packet leaves a fixed fraction of its charge e 
behind, regardless of signal amplitude or the charge contained in 
previous stations. The overall performance of the device is suitably 
characterized with a “transfer inefficiency product” ne multiplying 
the number of transfers n with the fraction of charge e left behind in 
each transfer. Experimentally, this ne product is determined by com- 
paring the observed output pulse train with calculated model plots. 
For large values of ne the delay of the maximum amplitude of the out- 
put pulse train is measured with respect to the calculated exit time 
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Fig. 6—Operation of the CCD as an IGFET by tying all electrodes together. 
Hysteresis is observed due to leakage of charge from the electrodes into the transfer 
gaps. 


from an ideal device. This delay, expressed in a number of time slots 
given by the inverse of the clock frequency, is numerically equal to the 
transfer inefficiency product ne.® 


5.2 Results 


A linear model describes the actual behavior of a real device im- 
perfectly. Small charge packets following a string of empty buckets 
show the biggest degradation. Figure 8 illustrates the dependence of 
the transfer efficiency on signal amplitude and on background charge. A 
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Fig. 7—Appearance of a single-charge packet at the output after transfer through 
a COD with a total inefficiency product ne (linear model). 


sharp spot of light with a visible diameter of about 2 um was focused 
on the gap between two electrodes 150 transfers from the output end. 
By varying the integration time, the linear amplitude range of the 
device was determined. The output pulse train was then monitored as 
a function of signal level [Fig. 8(a)_]. The inefficiency product decreases 
from 0.8 to 0.5 as the signal is increased from a fraction of 0.5 to 1.0 
of the linear amplitude range, which was defined earlier as the satura- 
tion point. If more charge is injected, it can no longer be held by a 
single potential well. Some charge then overflows into the neighboring 
stations, forward as well as backward. In the output pulse train, some 
charge is observed to come out earlier than the proper time slot. 

The influence of background charge was studied by illuminating the 
device uniformly at various intensities. A signal charge packet cor- 
responding to half a full well was injected with the sharp light spot and 
the output monitored as a function of the amount of background charge 
[Fig. 8(b)]. The infficiency product decreases from 0.8 to 0.4 as the 
background charge is increased from 0 to a fraction of 0.5 of the linear 
range. The first 20 percent of background charge yields the most 
significant improvement in performance. The measurements of Fig. 8 
have been taken at the clock rate of 1 MHz. The behavior is typical 
for frequencies below 2 MHz. 

In most of the following experiments the transfer inefficiency was 
measured by using the device as an analog shift register. Input diode 
and input gate were kept at de potentials such that the diode would 
just trickle a small amount of current into the device and thus provide 
some background charge. Every 256th clock pulse, the input diode was 
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Fig. 8—Experimentally observed signal amplitudes (a) as a function of the total 
charge placed into one packet, (b) as a function of background charge with constant 
input signal charge (1 MHz). 


pulsed to a more negative value for a time interval of about one-third 
of the clocking period and, thus, a well-defined packet of charge was 
injected. The charge packet was due to leave the device 500 clock pulses 
later. At that point, a time mark was generated that served as a refer- 
ence point. 

The signal from the output diode was led into a linear preamplifier 
consisting of a cascode stage with an active load, an emitter follower, 
and a current-feedback branch to the gate of the input J-FET. A set 
of three different amplifiers was used to cover the full range from 1 kHz 
to 17 MHz. The potential of the output diode was normally kept at 
about 10 V and the output gate at a de potential of 2 to 5 V. 
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Figure 9 shows the output from two 500-element devices operated in 
that mode at 1 MHz. Comparison with the calculated plots in Fig. 7 
shows that the two pulse trains correspond to inefficiency products 
of about 0.25 [Fig. 9(a)] and 3.0 [Fig. 9(b)]. However, in both 
cases, the tail of. the pulse train extends much further than in the 
calculated examples. In Fig. 9(a) at least five stations carry an 
observable amount of charge and in Fig. 9(b) the tail extends well 
beyond the range of the picture. This is due to the nonlinear depen- 
dence of transfer efficiency on signal amplitude. Owing to the shape of 
the potential well, small packets lose a larger fraction of their charge 
to the following packets by trapping effects in interface states.? The 
charge packets forming the tail of the pulse train are thus transferred 
less and less efficiently and become more and more delayed. This 
nonlinear behavior is also the reason that the addition of a small 
amount of background charge can drastically improve the performance. 
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Fig. 9—Two experimentally observed signal outputs that illustrate disagreement 
with linear model. 
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The transfer efficiency of the devices studied is a sensitive function 
of most of such operating parameters as waveform, amplitude, and 
bias of the driving pulses. The dependence on signal amplitude and 
background charge has already been mentioned. In the following ex- 
periments, the latter two parameters were adjusted by trial and error 
to give the best results. Background charge ranged typically from 20 to 
50 percent. 

The devices are also sensitive to any changes occurring near the 
surface of the silicon. Some devices have shown strong sensitivity to 
the ambient atmosphere. Breathing slightly onto the device can change 
its performance considerably. A majority of the mounted devices were 
thus covered with a phosphorous glass to protect the bare gaps between 
the electrodes from the influence of the ambient. Though the sensitivity 
to the atmosphere could be strongly reduced by this means, the devices 
still showed a dependence on the history of the investigation. Pro- 
longed operation (over one hour) at high potentials often considerably 
impaired the performance at lower potentials. The devices did, how- 
ever, recover and resume good performance at lower potentials after 
they had been turned off for a few hours. These kinds of instabilities 
can make the experiments very tedious. To reduce their effects on the 
results as much as possible, the measurements have been performed 
first at the lower voltages and then extended to larger pulse amplitudes. 

The quoted inefficiency products refer to the linear model except 
where otherwise stated. This seems to be justified since in the follow- 
ing experiments the emphasis is on the functional dependence rather 
than on absolute values. This approach simplifies interpretation for 
the reader since he can visualize that an inefficiency product ne which 
ranges between 7 andz + 1 means that the 7th station after the proper 
output time slot carries the maximum amount of charge. 

Figure 10 shows the results of an experiment designed to demonstrate 
the time dependence of the performance of a device. After the device 
was turned off for several days, an inefficiency product of 1.8 was 
measured. The device was then completely flooded with current for 
15 minutes by grounding the input diode while the pulses were left 
applied to the electrodes. The input diode was then returned to the 
normal condition and the performance measured at pulse amplitudes 
of 20 V. Figure 10 shows the strong degradation and subsequent 
recovery. A few seconds after the return to measurement conditions 
the ne value was about 20 and then recovered to 2.7 within one hour 
and to 2.4 by the next day. 

It is believed that the change in performance is due to a migration 
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Fig. 10—Strong degradation of performance and subsequent slow recovery intro- 
duced by strong saturation of the transfer channel. 


of charge at the interface between the gate dielectric and the deposited 
protective P glass. The excessive charge in the flooded transfer channel 
represents a ground plane that terminates the field lincs from the 
transfer electrodes. Since this charge sheet extends through the whole 
device, it is also present under the gaps between the electrodes. Field 
lines from the edges of the transfer electrodes have strong lateral 
components that can move charge along the outer surface of the gate 
dielectric and, thus, charge the surface above the gaps more positively. 
This generates potential pockets in the silicon which can trap part of 
the signal charge. In normal operation of the device, the forces that 
charge up the gaps are absent since the charge resides under the gaps 
only for a fraction of a nanosecond. 

The response to saturation is not equally strong in all devices. The 
state of the surface of the gate dielectric before the deposition of the 
protective P glass probably plays an important role. In the following 
measurements, prolonged saturation of the devices was carefully 
avoided. 
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Fig. 11—Transfer inefficiency product as a function of driving pulse amplitude for 
various waveforms at (a) 100 kHz and (b) 1 MHz. 


A set of experiments was performed to determine the best operating 
conditions at a given frequency. Values of ne have been determined as 
a function of driving pulse waveforms, amplitudes, and dc bias. 
Figure 11 shows the dependence on waveform. At lower amplitudes, 
shaped square pulses with slower trailing edges give best results. For 
higher amplitudes, the performance is fairly independent of the driving 
pulse form, including sine waves. 

The strong differences in the performance for the cases of square 
pulses of 10 V amplitude with mutual overlaps of 0 ns and 100 ns 
are further explored in Fig. 12. The pulses are about 300 ns long 
with rise and fall times of about 10 ns. The ne values are measured 
as a function of the overlap of the driving pulses. The results are 
plotted in two different ways. In Fig. 12(a) the overlap in time cor- 
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Fig. 12—Transfer inefficiency product as a function of mutual overlap for square 
driving pulses plotted in two different ways (see inset for definitions). 


responding to the 50-percent point of the waveforms is measured in 
nanoseconds. A negative overlap thus means that the waves cross at 
less than 50 percent of their peak values. The best performance is 
reached for overlaps of more than 10 ns. Figure 12(b) shows the same 
results plotted as a function of the amplitude at the crossing point. 
The best performance is reached for a crossing point higher than 90 
percent of peak amplitude, which corresponds to overlaps of more 
than 10 ns. 

Square pulses with 10-ns overlap have been used to evaluate the 
influence of the resting potential on the performance. In Fig. 13(a), 
the dependence on pulse amplitude is first established. For pulse 
amplitudes higher than about 10 V, which seem to be necessary to over- 
ride some barrier in the bare gaps, performance improves slowly but 
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monotonically toward higher amplitudes. For a pulse amplitude of 
15 V, the influence of the resting potential Vr was studied [Fig 13(b) ]. 
The device is operable in a range of Vz up to about 10 V with the best 
performance near 6 V. Again, it has to be pointed out that such a 
wide operating range has not been observed too often. 

In these conditions in which there are square pulses with 10-ns over- 
lap, the transfer efficiency was measured for clock rates between 1 kHz 
and 17 MHz. At each frequency, the amount of background charge 
and the signal level were adjusted to give the best results. Background 
charge ranged from 15 to 30 percent and signal amplitudes from 30 to 
50 percent of a full bucket. The devices were operated as shift registers 
in the continuous wave mode. Single charge packets were injected every 
256 clock pulses. Over more than three orders of magnitude from 1 
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Fig. 13—Transfer inefficiency product as a function of (a) pulse amplitude and (b) 
resting potential for clock frequencies of 1 MHz and 10 MHz. 
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Fig. 14—Transfer inefficiency product as a function of clock frequency or available 
transfer time. 


kHz to 2 MHz no significant differences in performance were mea- 
surable (Fig. 14). Then, from 2 MHz to 17 MHz the ne value rose from 
about 1.5 to 3.5. This decrease in performance does not stem from the 
free-charge-transfer mechanism itself. The time available per transfer, 
even at the highest frequency where it is 20 ns, is still long compared 
to the calculated transfer time constants, which are well below 1 ns.°® 
An increase in the interface state density toward the conduction band 
edge could account for the decrease in performance at the highest 
frequencies.® Furthermore, above 5 MHz, the driving pulses as ob- 
served on the connector to the device were far from perfect, and addi- 
tional degradation might have been produced by stray capacitance and 
by the diffused crossunders on the device itself. 


5.3 Discussion 


To study the transfer performance of a CCD, an optimum number of 
transfers exist, depending upon the performance of the device. If the 
device is too short, the degradation is too small to be measured ac- 
curately. If the device is too long, the degradation is large and, thus, 
again ne is difficult to measure accurately. Values of ne on the order 
of one are most casily measured. 
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For long devices another problem arises. The measured functional 
dependences might be smeared due to nonuniformities of some physical 
parameters along the transfer channel, such as flat-band voltage or 
electrode width. Figure 15 illustrates the integrated ne values measured 
with a spot of light injected at various distances from the output 
diode. It can be seen that ne is not linearly increasing along the device. 
In some other devices, sharp steps in the ne curves have been observed 
which often could be correlated with an obvious physical defect, such 
as a partly missing transfer electrode. 

The observed inefficiency products rangéd from 0.2 to several hun- 
dred. They are distributed in a log-normal manner around a value of 
20 with a standard deviation of about a factor of five. This spread is 
too high to be explained by variations in interface state densities. These 
devices are, however, very sensitive to variations in the fixed oxide 
charge in the transfer gaps. Too little or too much charge can lead to 
barriers or pockets in the interface potential, both of which strongly 
impair the transfer efficiency of the device. These effects associated 
with the transfer gaps are strong enough to override other parameters 
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Fig. 15—Transfer inefficiency product from various points of the device to the out- 
put. The charge was injected with a light spot. 
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that could influence the measured inefficiency products. A survey of 
the storage times of MOS capacitors, of the average dark currents of 
working devices, and of their ne values was performed on 35 slices in 
dynamic probe tests ; although the observed values for the storage times 
and for the dark currents correlated reasonably well from slice to 
slice, no correlation between ne and either of the other two measure- 
ments could be found. 

The best results observed are ne = 0.2, which corresponds to 
e = 1.3 X 10-4, a value comparable to results reported on 2-phase 
devices." Values between 10-4 and 10-* can be expected with inter- 
face states densities in the low 10!° cm~? eV—!.° The majority of devices, 
however, have e values between 10-* and 10~*. These values lie in the 
range of what has been obtained with ordinary bucket-brigade de- 
vices.!?3 This similarity leads to the suggestion that gapped devices 
often work in a similar mode. Fixed barriers in the gaps keep a reser- 
voir of carriers underneath each electrode. The signal charge modulates 
the barrier height, and transfer efficiencies comparable to bucket- 
brigade structures can be expected. 


VI. DARK CURRENT 


When the linear CCD is used as a line imaging device, one set of 
electrodes is switched to an integration potential V; during the time 
that charge is being integrated. The minority carriers generated by the 
incoming light are then collected by the potential wells underneath 
that set of electrodes. After a sufficient charge pattern has been ac- 
cumulated, the stored information is read out in serial form. 

In the absence of any illumination, minority carriers are still gener- 
ated by thermal effects and are collected in the nearest potential wells. 
The generation rate is not necessarily uniform over the whole device 
and, thus, this dark current can generate a pattern of its own. Figure 16 
shows the readout signal of a fairly nonuniform device after integra- 
tion times of 250 ms and 500 ms where in the latter case the highest 
peaks of the signal have already reached saturation. During readout 
each charge packet picks up a little bit of dark current from all the 
locations it passes on its way to the output diode. From all the other 
locations on the input end of the device, it had already received a dark 
current contribution during the readout of the previous line when 
the “empty” packet was moved from the input diode to the integra- 
tion site. Therefore, the same integral contribution is added to every 
charge packet. This uniform component can be subtracted or, if the 
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Fig. 16—Integrated dark current profile for two different integration times. 


readout time is kept short compared to the integration time, it can be 
neglected. 

For low-light-level-imaging applications, the dark current should be 
as low as possible. Cooling the device is a possible means to reduce 
the thermal carrier generation. For a simple demonstration the device 
was mounted on an open cooling block. Figure 17 shows the results for 
a temperature interval from —20°C to +60°C. As expected, the in- 
efficiency product showed no significant changes except below —15°C 
where the formation of ice degraded the operation of the device. The 
dark current measurements displayed in Fig. 17 were taken near ele- 
ment 150 in the signal shown in Fig. 16. Within the measurement 
accuracy they follow the calculated dependence given by the intrinsic 
carrier density 7. 

There are mainly two mechanisms that produce a dark current 
component which is proportional to n;.!41> The generation current 
arising from bulk states in a 5-um-wide depletion region is on the order 
of 6 nA/cm? for a minority carrier lifetime of 100 us, which typically 
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Fig. 17—Measured inefficiency products (left ordinate) and local dark current 
generation (right ordinate) compared to temperature dependence of intrinsic carrier 
density n; (matched at 20°C). 


can be expected in a good sample. The generation current arising 
from interface states with an estimated density of 2 X 10/cm?/eV 
at midband is on the order of 4.6 nA/cm?. Thus, surface and bulk 
dark current contributions are of the same order of magnitude. The 
shape of the thermal relaxation curve of the MOS capacitor can be 
analyzed to determine which is the dominant component in a partic- 
ular sample.!® In a CCD the situation is complicated by the fact that 
the two components normally do not have the same active generation 
area. While the strong depletion region is mainly localized underneath 
one electrode per element only, a small depletion region extends under- 
neath all electrodes and thus the surface contribution stems from an 
area that is several times larger. 

Several devices showed a fairly uniform dark current background 
on the order of 10 nA/cm2, but superposed by highly localized point 
defects. The temperature dependence of some of these localized 
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generation centers does not show the simple proportionality with n,, 
but has a range of weaker temperature dependences. At a given tem- 
perature, the number of observable current sources as well as their 
strength depends strongly on the electric field, i.e., on the potential V; 
applied to the integrating electrode set. Most defects seem to fill the 
integrating potential well and then stop their activity as would be 
expected from states that are only active in the depleted bulk of the 
silicon substrate. Other defects, however, possibly located at the 
interface, continue to generate minority carriers and fill adjacent wells 
and eventually a whole transfer channel. These observations indicate 
that there is a variety of localized defects. 


VII. LIGHT SENSITIVITY 


The described device can be used in slow-scan-imaging applications 
as a simple line scanner that integrates the information incident upon 
the transfer region itself. The transfer electrodes are opaque and, thus, 
about 50 percent of the light incident on the transfer region is lost. 
The resolution of such a system in the direction of the electronic scan 
has been discussed elsewhere!’ and experimental results presented.‘ 
The resolution in the direction of the mechanical scan depends on the 
effective light-collecting line width of the device. This width has been 
measured by probing the device with a very narrow spot of light 
(approximately 2 um wide) produced by an incandescent bulb from 
which the IR radiation has been filtered out. This spot was moved 
along two different lines across the transfer channel of the device (see 
Fig. 18). Line A lies beside one of the integrating electrodes, and the 
generated charge, thus, will spill mainly into the potential well under- 
neath. Carriers generated deep down in the bulk can reach adjacent 
wells by diffusion. To measure the total amount of charge, the signals 
of the two adjacent stations were added to the main station. To elimi- 
nate effects of transfer inefficiency, the experiment was performed close 
to the output diode. 

In a second experiment, the spot of light was moved along line B 
lying midway between two integrating electrodes. The charge was then 
distributed more or less equally into the two potential wells and the sum 
of the two signals was used in plotting Fig. 18. 

Both experiments yield the same sensitivity across the channel. 
The 50-percent point is about 10 um outside the edge of the transfer 
channel, indicating that the channel stopping diffusion does not provide 
adequate definition of the optical integration region. The equivalent 
line width of this image sensor is, thus, 35 um or about twice as large as 
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Fig. 18—Normalized light sensitivity measured with a narrow light spot in two 
different traces across the transfer channel. 


the length of a CCD element. Such a device should, therefore, give a 
much better resolution in the direction of the electronic scan than in 
the direction of the mechanical scan. This agrees with experimental 
observations. To improve the vertical resolution of this imaging system 
to match the horizontal resolution, the light-sensitive line width would 
have to be confined to about half the element length. 


Vill. VARIATIONS IN TECHNOLOGY 


Devices with different insulator thickness, electrode length, and 
metallization have been built to study the influence of these parameters 
on device behavior. While the effect on signal-handling ability followed 
a predictable pattern, the influence on transfer inefficiency was often 
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concealed by the large spread of the observed values. Still some 
trends became evident. 

If operated at the same potentials, a thinner insulator leads generally 
to a higher signal-handling ability, as expected from the higher capaci- 
tance. When the operating voltages are lowered proportionally to the 
reduced insulator thickness, the devices show poor performance. High 
potential differences between adjacent transfer electrodes are still 
necessary to produce fringe fields that can overcome the fixed potential 
barriers in the gap. With a thinner insulator, the dependence on Vr 
becomes more critical, and the devices show a narrower operating 
range. This trend can be compensated for if, at the same time, the 
gaps are narrowed. With an oxide thickness of 1500 A, 2 um seemed to 
be an appropriate gap width to achieve an average operating range for 
Ver on the order of 1 V. No significant change in behavior could be 
attributed to the replacement of 1500 A of SiO, with a double insulator 
structure consisting of 1200 A SiO, and 500 A Al.Os. 

Three different metallizations have been compared: W, Al, Ti-Pd-Ni. 
No specific difference in performance was observed in devices with 
chemically etched W or Al electrodes. On devices with a Ti-Pd-Ni 
metallization on a Si0,-Al,O; insulator a backsputtering process’ 
was used to obtain the required accuracy in the delineation of the 
transfer electrodes. Figure 19 compares the results of the dynamic 
probe tests on all operating backsputter delineated devices with a 
control batch with chemically etched electrodes on the same double 
insulator. The backsputtered devices showed ne values that were on 
the average about a factor of six higher than the values obtained on 
devices with W or Al electrodes. Among possible causes, differences in 
the interface state density underneath the electrodes and variations in 
gap width due to a possible undercutting of the Ti have been ruled out 
experimentally. Thus, it is conjectured that the backsputter process 
degrades the integrity of the Si-SiO» interface in the region of the gaps 
where the metal is thinned to within 1000 A of the insulator surface 
during backsputtering.!® In spite of the wide spread of values, this 
particular trend in ne was clearly visible, because its origin itself is 
associated with the transfer gaps, which are the single most significant 
cause for high transfer inefficiency. 

To reduce the sensitivity to the ambient, some slices were protected 
with a dielectric level of, for example, 1 um of phosphorous glass or 
1000 A of silicon nitride. While the reaction to such simple tests as 
“breathing onto the device” was strongly reduced, transfer efficiency 
did not improve on the average, nor did the slow instabilities disappear. 
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Fig. 19—Plot of ne distribution for all operating backsputter delineated devices 
and for a control batch with etched electrodes. 


The results generally leave the impression that the condition of the 
surface of the gate insulator in the gaps during the deposition of the 
protective dielectric is crucial to the final performance of the device. 

In another attempt to control the potential in the gaps, six finished 
devices with 1500 A of SiO, and 3 um gaps, which orginally operated 
with ne products of 5 to 10, were overcoated with a strip of poly- 
crystalline silicon, with sheet resistances ranging from 10° to 10” 
ohms/square. In two cases the ne product improved to values around 
two and in one case even below one. The performance of the other 
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three devices worsened and showed some erratic behavior. Still, these 
results suggest that a properly developed resistive sea might possibly 
yield a solution to the gap problem. 


IX. SUMMARY 


Signal-handling ability, transfer efficiency, and dark-current profiles 
of over a hundred individual linear CCD’s have been studied. The 
total spread of the result is surprisingly wide. Many erratic effects have 
been observed but not investigated in detail and, therefore, are not 
fully understood. On several devices which showed reasonably good 
performance and no erratic behavior, a thorough investigation of the 
functional dependence of the performance on various parameters has 
been carried out. These dependences are well understood and can be 
expluined with simple models. Deviations from these simple models 
as well as the limitations in performance seem to be mainly associated 
with the gaps between the transfer electrodes. Poorly controlled surface 
potential can lead to the formation of barriers or pockets that produce 
poor transfer efficiency, slow instabilities, and nonuniformities in 
large devices. 

Three-phase CCD’s with gaps between the electrodes have been an 
invaluable tool for the investigation of charge coupling and for an 
early demonstration of charge-coupled image sensors. But the tech- 
nologies presently used to make these structures do not produce reliable 
devices with consistent performance. With some technological effort 
a solution to the gap problem can probably be found, for instance in 
the application of a resistive sea on the surface of the device. It is 
questionable, however, if it is worthwhile to put such a development 
effort into this structure. In big devices the large number of narrow 
gaps cause a serious reduction in yield. Furthermore, the peripheral 
structures in more complex devices might require more than one level 
of metallization. It seems more advantageous to build the actual CCD 
with overlapping gates and to provide a completely sealed channel. 
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Sampled-Data System Approach to Model 
Time-Division Switches 
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The sampling switch in a time-division switching system is, in general, 
different from the sampler of sampled-data system theory. A general 
approach is developed for characterizing such a switch as an ideal sampler 
plus some modified transfer functions. With this approach, a time-division 
switching circuit containing a sampling switch can be converted easily 
to a typical sampled-data system, and the well-established mathematical 
tools for sampled-data systems, such as the Z-transform, can be applied. 
In addition, a simplified approach is described that will lead to a very 
good approximation of the “‘exact’”’ solution. 


|. INTRODUCTION 


The transfer function approach developed for sampled-data systems 
has proved to be a very powerful tool for analyzing time-division sys- 
tems.!~3 It yields information useful for both analysis and synthesis of 
the system. However, its application is often limited due to the fact 
that the sampling switch in a time-division circuit is different from the 
sampler of sampled-data-system theory. This difference can be seen 
from the fact that the voltage at the output side of a sampler in a 
sampled-data system is always zero between sampling instants, while 
the voltage at the output side of a sampling switch in a time-division 
circuit is not necessarily zero between sampling instants, if, for ex- 
ample, the switch is connected to a capacitor. As a consequence, one 
cannot treat a time-division circuit as a sampled-data system unless 
the sampling switch can be modeled by a sampler plus a modified 
system-transfer function. 

Of the few who have worked on time-division-system analysis,’ ° 
only Desoer! has come close to using functional blocks to model a 
sampling switch, but no general approach has been developed. It is 
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the purpose of this paper to present a general approach for solving this 
problem. With this general approach, any time-division circuit con- 
taining a sampling switch can be converted to a typical sampled-data 
system, and the well-established mathematical tools for sampled-data 
systems, such as the Z-transform, can be applied. 


ll. FORMULATION 


In a sampled-data system, the sampled signal is related to the original 
signal by a sampling device such as is shown in Fig. 1. The output of 
the samples is a train of amplitude-modulated pulses. The interval T 
between the consecutive pulses is called the sampling period, and the 
pulse width p is referred to as the sampling duration. In the ideal case, 
we assume that the sampler operates in zero time so that the pulse 
width p is equal to zero. Then the output of an ideal sampler is a train 
of amplitude-modulated impulses and is related to the input by 


v*(t) = > v(nT)3(t — nT), (1) 
where 6 is the Dirac Delta function. We note that whether the operating 
time of the sampler is zero or not, the sampled voltage is always zero 
between samplings. 

A switch operating periodically in a time-division system is not 
equivalent to a sampler in a sampled-data system, because the signal 
at the output side of a switch is not necessarily zero between samplings. 
However, if an ideal amplifier with zero output and infinite input 
impedances is added to the switch, as shown in Fig. 2, then the output 
signal of the amplifier is equal to zero between samplings. In fact, if 
the sampling duration is much shorter than the sampling period, then 
an input-output signal relation identical to (1) can be obtained. Thus, 
if the switch in a time-division circuit is followed by an amplifier, then 


v(t) v*(t) 


v*(t)=0 
FOR (n—1) T+p<t<nT 
n=1,2,3,-""° 
Pp T \2T 
Ttp 


T-SAMPLING DEVICE 
fate od 


1 ! v*(t) 

(aa | 

CLOSE FOR p SECOND 
EVERY T SECOND 


Fig. 1—Sampling device. 
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IDEAL AMPLIFIER 
SWITCH v*(t) 


v(t) LOAD 


Fig 2—Time-division switch followed by an ideal amplifier. 


the sampled-data system techniques are directly applicable. In practice, 
this is not always the case. Sampling in a time-division system is 
frequently performed by a switch connected directly to a time-division 
bus. Our objective is to characterize the switch by a sampler plus some 
modified system-transfer function so that any time-division system 
employing periodic sampling can be treated as a standard sampled- 
data system. 

In general, the time-division system we are interested in has the form 
shown in Fig. 3. It consists of two networks connected by a switch in 
series with some finite impedance Z,. The switch is closed periodically 
for a brief interval of p seconds every T seconds. The smallest time 
constant of the input signal and the sampling period T are both much 
greater than p. Referring to Fig. 3, we define v12(¢) as the difference 
between v(t) and v2(t), the voltages at terminals 1 and 2, respectively ; 
Voe(t) is defined as the open circuit Thevenin equivalent voltage at 
terminal 1 and z(t) as the current in the switch. The current 7(t) can 
be found from the equivalent circuit (Fig. 4) obtained by connecting 
the driving source e(¢) = v.-(t) in series with the time-division switch 
and impedances Zo, Z1, and Zz, where Z; and Z, are the output im- 
pedance of network 1 and input impedance of network 2, respectively. 
As the switch is closed only for a time interval from ¢ = nT to 


Sail ——- >| 


SWITCH 







oe NETWORK 1 NETWORK 2 


Fig. 3—General time-division system. 
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SWITCH 





Fig. 4—Equivalent circuit for solving for g(t). 


t=nT + p,n=0, 1, 2, ---, we may express 7(é) as 


Specs, | bale) nT StsnT +p 
ie {5 otherwise (2) 
or 
i(t) = bin Lu — nT) — ult — nT — p)], (3) 


where u(t) is the unit step-function, T is the sampling period, and p 
is the sampling duration. 

To solve for 7,(¢), we let the switch in the equivalent circuit (Fig. 4) 
close only for a time interval from ¢ = nT tot = nT + p. The driving 
source e(t) should be modified to vo.(t) — v10(nT-) — v20(nT-), where 


Vi0(t) = Vo-(t) — v1 (t) = 2(t) 02, (t) 
Ve0(t) = U2 (t) = 1(t) O22(t) 


and o denotes the convolution product. Note that vio(n7-) and 
veo(n7'-) are the voltages across Z; and Z, just before the switch is 
closed. For the small time interval nT < t < nT + 2, Voc (t) & voe(nT).” 
Therefore, e(t) ~ v..(nT) — vio(nT-) — veo(nT-). Defining* 


va(b) Voc(t) = V10(t-) = Vo9(t~) 
Voc(t) = Vr0(t =. €) =— Voo(t —= €), (5) 


where e > 0 is an arbitrarily small ae the current n7(t) can be 
expressed as 


(4) 


I 


ll 


in(t) = 7 poo Y (8)+e eee, (6) 


where Y(S) = 1/[Zo(S) + Zi(S) + Z2(S)] is the admittance func- 
tion of the equivalent circuit and £—' denotes inverse Laplace trans- 


*From (5) we note that at the sample instant t= (nT), va(nT’) = v1(nT-) — 02(nT-) 
for n = 1, and v2(0) = v,.(0) = 0 for a physically realizable system. Hence, va(nT’) 
= u12(nT~), the difference between »,(t) and v2(t) just before the switch is closed. 
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formation. Substituting (6) into (8), we have 


a(t) = > va(nT) £ E Y (8S) ae 
-Tu(t — nT) — u(t — nT — p)]. (7) 


Equation (7) suggests that we may characterize the switch by a 
sampler plus a transfer function G(S), as shown in Fig. 5, to relate va 
and 7 by 

(8) = Va(S8)G@(8), (8) 


where I(S), Vz(S), and G(S) are the Laplace transforms of i(t), 
va(t), and g(t), respectively (similar notations will be used hereafter 
without explanation). By the definition of 1mpulse-modulated signal, 


va(t) = ¥ va(nT)8(t — nT). (9) 


n=0 

In the time domain, eq. (8) corresponds to the convolution integral 
t 

i(t) = ) vi(r)g(t — r)dr. (10) 
0 


Substituting (9) into (10) and integrating we have 
i(t) = ¥ va(nT)g(t — nT)u(t — nT). (11) 
n=0 
Comparing (7) and (11), we have 


gu) = 2 | §-¥(S)]-Lu®) — wep) 2) 


and 
es) -s2{e|evO]-tuo-wt-pl}, a9) 


where £ is the Laplace transformation operator. Equation (13) yields 
the transfer function we need to characterize the switch. Note that 
the function £—[(1/S)Y(s)] is the current i(¢) in Fig. 4 with a unit 
de driving source. Once G(S) is found, a functional block diagram 


SAMPLER 
G(S) = L[g(t)] 


Fig. 5—Characterization of the switch. 
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[ eres! 
DELAY ELEMENT 


e— or 


Fig. 6—Transfer function block diagram between V,. and V2. 


describing the signal flow from network 1 to network 2 can be con- 
structed easily, as shown in Fig. 6. Now the system in Fig. 3 is con- 
verted to a standard sampled-data system. 

The transfer function from v,, to ve can be obtained from Fig. 6: 


Va(S) = Voe(S) — Vio(S)e~*’ — Vao(S)e7*s 
= V,.(8S) — I(S)[Zi(S) + Z2(S) Je~*S 
= Voe(S) _ Va(S)G(S)[Z1(S) -}- Z2(S) jes 


Va(S) = Vae(S) — Va(S)[@Zi(S)e-* + GZ2(S)e“]* 


or 
eee ee 
~ 1+ [4Z,(S)e-8 + GZ.e- ]*? 
where GZ;(S) = G(S)Z;(S) and 7 = 1, 2, and e > 0 is arbitrarily 
small. From (14), 
V2(S) = I(S)Z2(S) = Va(S)G(S)Z2(S) 
_ GZ_(S) 
~ 1+ [GZ,(S)e— + GZ,(S)e*5 }* 
To find [GZ,(S)e-*5]*, we need to know the relationship between 

gz,(t) and [gz;(¢ — e)]*. The function g(é) is defined in (12): 


Va(S) (14) 





Voe(S). (15) 





. ists 
00 = ©") srr ERT ET | MO ~ MO~ 7 


A(t)u(t) — h(u(t — p), 


where 


SS a a Oe (F 
h(t) = & | Stas + Z;(S) + Z| 


Note that h(¢) is the step-response of a linear passive network and, thus, 
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is continuous for all t > 0. Now 
t 
geilt) =f g(a)edlt — nar 


= [ h(r)zi(t — r)u(r)dr -f[ h(r)as(t — r)u(r — p)dr. 


From the convolution of two continuous time functions, gz, (é) will be 
continuous for all 0 <t< p and ¢> p. Since p < T’, gz:(t) will be 
continuous at t = nT for all n = 1, ie., 


gei(nT — e) = gz:(nT) (16) 
as « approaches zero. At t = 0, since gz,(¢) = 0 for all t < 0, we have 
gzi(0 — «) £0. (17) 
From (16) and (17), 
> gei(nT — «)d(t — nT) = 5 gzi(nT)d(t — nT) 
n=0 n=1 


ive] 


= > gzi(nT)6(t — nT) — gz:(0)6 (2) 


n=0 


for arbitrarily small «. Therefore, 





[gzi(t — ¢)]* = gzi(t) — gzi(0)6(2) (18) 
and 
[GZ;(S)e-*S]}* = GZ;(S) — gz.(0). (19) 
From (15) and (19), we have: 
Voe(S) 1 — gei(0) — ge2(0) + GZ;(S) + GZ2(8) 
and 


V3(8) _ AC) | 
Vi-(S) 1 — gzi(0) — gze(0) + GZi(S) + GZ9(S) 


If we are interested in the transfer function V2(S)/Vi(S), then since 
Vi(S) = V.o.(S) — I(S)Z1(8), we have: 


Vi(S) = Voe(S) — Va(S)GZ1(S). (22) 
Substituting (14) into (22), 


1 + GZz(S) — gzi(0) — gz2(0) ; 
1 + GZ{(8) + @Z2(8) — gzi(0) — gz2(0) 


(21) 


Vi(S) = Vo-(S) (23) 


TIME-DIVISION SWITCHES 617 


From (20), (21), and (28), we have: 


Vi(S) 1 — gai) — gz2(0) + GZ;(8) 


and 
Vi(S) _ G23 (8) | 
Vi(S) 1 — gei(0) — gz2(0) + GZ2(8) 
Since gz;(0) = lims.. S:GZ;(S), it will be equal to zero when the func- 


tion GZ;(S) has at least two more poles then zeros. In such cases, (20), 
(21), (24), and (25) become 





(25) 








V8) G24(8) 

Vi(8) — 1+ G2i(S) + GZS) oe 
Vi(S) _ GZS) 

Vi(S) 1 +GR(8) + O28) ey 
V8) GZx(8) 

Vi(S) 1+ GZi(S) ees) 
VilS) _  G2%(8) on 





Vi(S) 1+ GZ3(S)_ 


As a simple example, let us refer to Fig. 7, which shows an ideal 
sample-and-hold circuit with a capacitor C. For this circuit Z)>= Z1=0, 
= 1/CS, and it can easily be found that G(S) = C. Therefore, 


G(S)Z.(8) = 3 (26) 


Since it has only one more pole than it has zeros, eq. (24) should be 
used to find V2(S8)/V;(S): 
V2(8) = 1/S ey oes aes (27) 
Vi(S) [1 — u(0)] + L1/S} s 
The general approach used to characterize the time-division switch 
can be summarized as follows: 


(2) Form an equivalent circuit (Fig. 4) by using a unit step 
voltage source to drive the impedances Zo, Zi, and Z2 con- 
nected in series, where Z; is the output impedance of N1 and 
Z, is the input impedance of N2. 

(it) Solve for the current 7(¢) in the equivalent circuit. 

(iit) Let g(t) = 7()[u() — u(t — p)j. Calculate G(S) = L{g(t)} 
and GZ;(S) = G(S)Z,(S8), 7 = 1, 2. 
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SWITCH 


v(t) Cc 


Fig. 7—Ideal sample-and-hold circuit. 


(iv) Now the energy transfer between N1 and N2 can be described 
by a sampler plus some transfer functions. Either one of the 
following formulas may be used: 





Voe(S) 1 — gai(0) — gz2(0) + GZi(S) + GZ2(S) 


Vi(S) 1 — ger(0) — gz2(0) + GZ3(S)’ 


where v,. is the open circuit voltage at terminal 1 in Fig. 3. When the 
function GZ;(S) has at least two more poles than zeros, we know 
immediately that gz:(0) = 0 and can be removed from the above 
formulas. 


lil. AN APPLICATION 


The switch we model here is a practically realizable sample-and-hold 
switch for a time-division switching system. It is shown in Fig. 8. The 
series resistor R represents the gate resistance during sampling. The 
series inductor represents the lead inductance whose value depends 
on the bus structure. 

For this circuit, Z; = 0, Z. = 1/CS, and Z) = SL + R. Therefore, 
g(t) can be found by solving a simple series RLC circuit with unit de 
input and a switch closed at ¢ = 0 and open at t = p for the rest of 
the time. The result is 


1 


g(t) = Sha (e~At — oP) Tut) —uli— p)], (30) 


SWITCH R L 


GATE RESISTANCE R=302 
(t) LEAD INDUCTANCE L 
v 


c SAMPLE-AND-HOLD CAPACITANCE C= 1000pF 
SAMPLING PERIOD T = 83.3 us 
SAMPLING INTERVAL p = 300ns 


Fig. 8—Practical sample-and-hold switch. 
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where a= R/2L, wo = 1/VLC, Bi = a — Vo® — w2, and B, =a 
+a? — w2. From (80) we have: 


Aes 1 1 — e-?(St81) = | — gop (St+62) di 
(= | S+ 61 S + Be on 
The transfer function H(S) = V2(S)/V;i(S) can now be found from 

H(8) = Vo(S) _ G(S)Z2(S) (32) 





Vi(S) 1+ [G(S)Z.(S8)}* 


The calculation of G(S)Z2(S) and [G(S)Z.(S) ]* is given in the ap- 
pendix. With G(S)Z.(S) and [G(S)Z.(S) ]* known, we have 


H(S) _ 1- oe BiBo k — e7P (S+81) = b= e~P (S+82) 





S Bi — Bs S + B1 S + Be 
1 
T+ ets (8) 
where 
Bye 82”? = Boe 41P 
ee 4 
: Ba Bi 7) 
From (88), 
V2(S) 
H*(S) = = 
am aE 
= 1 
= l—erTs 1 
ters TF 
_ 1+k 
— FETR ey) 
When the driving function is sinusoidal, we have: 
: 1+k 
H* (jw) = eer Lk (36) 
: 1l+k 
H* gah thee Se eee a7 
ee NS as kes al oe 


This shows that, for k ~ 0, the magnitude of the voltage gain at the 
sampling instant will be a function of frequency. The maximum (or 
minimum, depending upon the sign of k*) occurs at w7' = 7a, i.e., the 


*It can be easily verified from (34) that k will be positive only if the equivalent 
RLC circuit is under damped. 
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of 
ares: 5 
OUTPUT SIGNAL jag seme ee 
te NUT SIGNAL 


ae 





f = fs/4 f = fs/2 


Fig. 9—Variation of magnitude of output signal at sampling gate with respect to 
input frequency f. 


half sampling frequency: 


ees ee ee 
| H (70). | axtreing i [1 == k| (38) 


Figure 9 shows the laboratory observation of such an effect for k ~ §. 
When the lead inductance is negligibly small, ie, Z—0, then 
B.2— © and 6, > 1/RC. Equations (83), (35), and (37) now become: 


1] — ets a 1 — e74Pe—pS 


WS) = SSF al— eters ee 

1 — ea 
H*(8) = 3 ——— (40) 
[H*(Jo)| = =a (41) 


V1 + 6-297 — 2e-4? cos wT’ 
where a = 1/RC. 
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In this case, the RLC-series circuit reduces to an RC circuit. In a 
practical time-division-switching system, the typical values might be: 
sampling period T = 83.3 us (sampling at 12 kHz), sampling duration 
p = 300 ns, hold-capacitor C = 1000 pF, and gate resistance R = 30. 
A simple calculation will show that 

p _ 300X10° _ 


=e 30x 102 2% (42) 


Hence, e~?? = e~!° ~ Oand the transfer function H(S) in (89) becomes 


1 
1—eTS RC 
H(§) ~—7— (43) 


which indicates that the switch-and-hold circuit can-be approximately 
considered as an ideal sample-and-hold device in series with an RC 
circuit, as shown in Fig. 10. 

It is also interesting to note that if both the gate resistance R and 
the lead inductance L approach 0, we will have an ideal sample-and- 
hold switch. From (43), we can see that H(S) will approach the ideal 
sample-and-hold transfer function 1 — e—75/S as expected. 


IV. AN APPROXIMATION 


In this section, we shall present a simplified approach which in 
general leads to a very good approximation of the results found by the 
general approach described in Section III. The basic idea here is to 
approximate the current 7(¢) in the switch by an impulse-modulated 
signal, 

i(t) ~ 7(2), (44) 


and characterize the switch by the energy transfer during the sampling 
duration: 


ve(nTt) = ve(nT-) + yLo1(nT-) — v2(nT-)], (45) 
IDEAL IDEAL 
SWITCH AMPLIFIER 
R 
Vit) Cc Cc 


Fig. 10—Equivalent circuit for Fig. 8 when L approaches 0. 
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where nT represents the instant just before the switch is closed, and 
nT* the instant just after the switch is reopened. The determination of 
y, which is related to the transfer loss, will be discussed later. From 
(45) we have: 


v(t) = y[nlt — 6) * + (1 — y)[r(t — &) }* (46) 
and‘ 
V2(S) = y[VilS)e“S ]* + (1 — y)[V2(S)e-8}*, (47) 
where e is arbitrarily small. 
Substituting 
Vi(S) = Voe(S) — I*(8)Z1(8) 
and 


V.(S) = I*(8)Z2(S) 
into (47), we have 


I*(8)Z3(S) = y[Voe(S)e-8 ]* + I*(S){—yLZi (Ses * 
+ (1 — y)[Z2(S)e“S]}*}. (48) 


As V,-(t) is continuous for all t = 0, and z;(¢) is continuous for all 
t> 0, 
I*(S8)Z3(8) = yVo.(8S) + I*(S){-v[Zi(8) — 21(0)] . 

+ (1 — y)[Z2(S8) — 22(0)]} 














or 
ie. yVoe(S) . 
PO) TS ta@i a0) @= neo. 
Therefore, 
V.(S) _ I*(8)Z2(S) 
VielS) Woe) 
a yZ(8) (50) 
yLZi(S) + Z3(8)] — ya1(0) — (y — 1)z2(0)’ 
and 
Vi(S) _ T*(S)Z2(S) 
Vi(S) — Vae(S) — I*(S)Z7 (8) 
yZ2(S) (51) 


~ yBi(8) — yai(0) — (y — 1z2(0) 


tT Note that as v;(t) may not be continuous at t = nT, n = 0,1, -«-, 
[Vi(S)e-S]* # Vi(S) — 10). 
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To determine the constant y, we note that the change in v, during 
the sampling duration is a function of the current in the sampling 
switch : 

Ave(nT) = ve(nT*) — v2(nT-) 


in (t) -2o(t) |tanr+p- (52) 


I 


As mentioned earlier, 7,(¢) can be solved from Fig. 4 with the driving 
source e(¢) = va(nT) and the switch closed at t = nT. Since v,,(t) is 
continuous for all ¢ 20, va(nT) = vx(nT-) = 1(nT-) — v2(nT-). 
Hence, in the equivalent circuit of Fig. 4, we let e(t) = vie(nT-) and 
close the switch at ¢ = nT’. Solving this circuit, the current will be 
in(t) and the voltage across Z, at t = nT + p will be Avo(nT); ie., 


Av(nT) = £7 {2 | iF) ¥ (8) --*88.Z2(8) || ae (53) 


where Y(S) = 1/[Zo(S) + Z1(S) + Zo(S)]. From (53), 


zl, Av2(nT) 
a Vie (nT) 





ll 


eo {© [ gY-e-78- 2415) |} 


t=nT+p 


ea {© E ¥(S)-Z2(8) |} : (54) 
Hence, in Fig. 4, if we let e(¢) = u(t) and close the switch at ¢ = 0, 
then the voltage across Z, at t = p will be the value of y. After y is 
found, either (50) or (51) may be used to enable us to replace the 
switch by an ideal sampler plus a transfer function, as shown in 
Fig. 11. 

To illustrate how this approach works, let us return to the practical 
sample-and-hold switch in Fig. 8. As stated in the last section, Z, = 0, 
Zo = R+ SL, and Z, = 1/CS. Solving the series RLC circuit with the 


eee oy 


Voc | Voc* 4 Vo H(S) = 18) 
O S ya (tetay ok CGAP ee 
iene 3s is) Y[24(8}+Z3(8)] —¥2,(0)-(7-1) 2510) 
is ee ed 


Fig. 11—Transfer function diagram for the approximate approach. 
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driving source of u(t), the current z(t) is found as shown in (30): 





e 
, = —_ (efit — e—frt 
Z(t) 2D a2 — w2 (e ; é De (55) 
Now y, the voltage across Z2 at t = p, can be found: 
ye ae 
= f° iolat 
B1B2 1 = 1 
= — (1 — e417) — — (1 — eben 
aos | a! on) B, | 7 | 
=1+k, (56) 


where 
Bye 82? — Boe-Air 


Bo — Br 


is the same k given by (84) in the last section. Therefore, 


k 


1 
Vi(S) 1 |*_&k 
a+e| ae | -§ 
l1—eT 1+k 


= 3 TF ke (67) 


(8) = 


We now want to show that H(S) of (57) is a good approximation of 
H(S8) of (33). From (33), we have: 





1- —TS F S 
H(S) = Se (58) 
where 
— _fiBs 3 SS | 
FS) = gees | S+ Bi S + Be (59) 


To show that H(S) ~ A(S), we want to show that F(S) ~1 + k. 
Since p is small, we have: 


2 2 
R(S) = PO |p - FS +6) p+ 5S +H) 


_ Bibs 
= ee? 
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and 


Pa eS ae eS ee 


Bo — Bi Bo — Bi 


~_} _ Bip? ] _ 3p? 
=e ep, E | xp + | By | exp 7 || 





Hence, ’(S) ~1+k and H(S) ~ A(S). 

Finally, we note that as Zo approaches zero, the current z(t) does 
approach an impulse-modulated function. Thus, the approach de- 
scribed in this section will always lead to the true answer when Zp = 0. 
For example, when R, L — 0, the switch we modeled above becomes 
the ideal sample-and-hold switch. In this case k > 0 and A(S) of (57) 
approaches 1 — e—7S/S, the familiar ideal sample-and-hold transfer 
function. It can also be easily seen that if we start with this ideal 
sample-and-hold switch, i.e., Zo = Zi = Oand Z, = 1/CS, then y = 1 
and Z3(S) = 1/C(1 — e-TS). From (51), we shall again have 
V2(S8)/Vi(S) = 1 — e-7S/S as expected. 
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APPENDIX 

Calculation of G(S)Z.(S) and [G(S)Z.(S)] * 

Since Z.(S) = 1/CS, 61 =a — Vo? — we, Be = a+ Vo? — w?, and 

we = 1/LC, we have: 
1 1 1 

GZ(S) = G(S)Z2(8) = LC 2\q Gt aS 


| 1 — e-P(St61) «| — ep (S+82) 











Sth £Sth 
— BiB2 Lf 1 — er St8 1 — eS +82) (60) 
se eee in| 
and 
aca) — _ 2182 2 (-1)" 
cae Bo — Ba | Pa Bi; 
1 e P88) 1 e-P(S+Bi) 7 * 
Slee ea aap, | ' 
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Hence, 


B18 2 (—1)*# 1 e fipe—TS 
ams) = Pl GE | en en 


to = eS e-or 1 — e-?Se-biT 


acs eet aie | 





Bo — Brim B; L—e¢7 
2 —BrB2 | ets n| 1 — e-Aip = 1l- = 








Bo — Bi 1 — e785 Bi Be 
1 =< 
m: ar amar =. = € pos i= B+ Bye7 82? = B2e—A1” | 
e7 Ts 
where 
4 tem gee o 
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An Optical Repeater With High- 
Impedance Input Amplifier 


By J. E. GOELL 
(Manuscript received September 14, 1973) 


A 6.8-Mb/s repeater for fiber optic communication systems is described 
which incorporates a high-impedance input amplifier. It ts shown that 
by utilizing an input circuit with a time constant which ts long compared 
to the bit interval and equalizing after the signal has been sufficiently 
amplified to set the signal-to-noise ratio, thermal noise can be decreased. 
As a result, a reduction can be realized in the required signal and, with 
an avalanche detector, in the optimum gain. 

The repeater, which was realized in a compact form employing standard 
integrated circuits, utilizes a GaAs light-emitting diode as its optical 
source. Other features include automatic gain and threshold controls and 
recovered timing. 


l. INTRODUCTION 


Digital communication systems utilizing low-loss optical fibers are 
presently being investigated. The realization of fibers with losses as 
low as 4 dB/km! has opened the way for numerous applications. 
System configurations will depend on such factors as fiber dispersion, 
fiber cost, desired information capacity, and terminal costs. Trans- 
mission rates near 6.3 Mb/s are attractive because fiber group delay 
dispersion is not expected to be a problem even with an incoherent 
source, and a wide variety of low-cost integrated circuits are applicable. 
If, as now appears likely, the fiber cost is low, then space multiplex 
could be an attractive alternative to time multiplex in achieving high 
capacity. 

The repeater described here incorporates a high-impedance input 
amplifier which is similar in approach to ones that have been used for 
other applications incorporating a capacitive detector such as nuclear 
particle counters? and television cameras.’ As a result of the high-input 
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impedance, the power required to achieve a specified error rate is re- 
duced, as is the optimum gain if an avalanche detector is used. The 
latter advantage is important since it eases the fabrication of the de- 
tector diode and increases its thermal stability. 

The repeater employs return-to-zero pulses with 50-percent duty 
cycle. The only word pattern restriction is that an occasional ‘‘one”’ 
be included so timing can be recovered and signal level can be deter- 
mined. The optical signal is generated by a gallium arsenide light- 
emitting diode (LED) operating at 0.9 wavelength.* Diodes of the 
type employed have been built with output powers of up to about 
5 mW. Optical powers to 1 mW have been coupled from these diodes 
into a fiber with a 0.63 numerical aperture. The repeater was fabricated 
in a compact form using standard integrated circuits. Automatic gain 
and threshold controls were provided so the optical input power could 
vary over a wide range. Clamping was employed to prevent baseline 
wander with an unbalanced data content of the signal, and timing 
was extracted by a phase-locked loop. 


Il. THEORY 


A typical circuit for a photodiode driving an input amplifier is shown 
in Fig. 1(a) and its equivalent circuit in Fig. 1(b). The current gener- 
ator 272 is the photo current. R, is the de return resistor for the detector, 

and 7, is the noise generator associated with it. The capacitor Ca is the 


AMPLIFIER 





IDEAL 
AMPLIFIER 





DIODE | AMPLIFIER 
(b) 
Fig. 1—(a) Input circuit. (b) Equivalent circuit. 
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output capacitance of the diode, C; is the input capacitance to the 
amplifier (excluding feedback effects), and R; is the input resistance of 
the amplifier. The quantities e2, and 73, are the spectral noise densities of 
the series voltage and shunt current generators which characterize the 
noise properties of the amplifier. 

A common approach to circuit design has been to set 


T= RC 
< 1/baud rate, 


where 
R,R; 
Ht 7 R, + R; 
and 
C =Car+'¢; 


to minimize noise while not introducing significant intersymbol 
interference. 

To achieve this, 7, and 7, are often major sources of noise. There- 
fore, from the standpoint of noise, it is preferable to make R, very 
large and employ an amplifier with low 7,,, even if 7 >> baud interval, 
to amplify the signal sufficiently to set the signal-to-noise ratio, and 
then to equalize the resulting distortion to eliminate intersymbol 
interference. 

Two possible limitations to the high-impedance approach exist, both 
related to the low-frequency component of the signal developed across 
the detector. The difference in voltage between a long string of “‘ones”’ 
and a long string of ‘‘zeros’’ is proportional to the de load resistance on 
the diode. Thus, the required dynamic range of the amplifiers preced- 
ing the equalizer increases with increasing R. Furthermore, with an 
avalanche detector this voltage could change the avalanche gain since 
it is in series with the diode bias. These two factors, which increase in 
importance as baud rate decreases, ultimately limit the magnitude of 
the detector load resistance. 

In the remainder of this section the relationships between error rate, 
signal power, and the circuit parameters are discussed for a binary 
signal with both states equally likely. It is assumed that Gaussian 
noise statistics apply, that dark current is negligible, and that the 
amplifiers preceding the equalizer are linear. 

Personick® has shown that for a pulse of average power p. with 
avalanche gain of mean square (g*), if we assume the optical pulses are 
distinct, the ratio of the pulse peak to the root mean square thermal 
noise in the baseband circuit is equal to the ratio of the average cur- 
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rent of the received pulse to the square root of the quantity 
2 2 
12(p,) = EPG +n, (1) 


where n,, the mean square thermal noise current* weighted to correct 
for input and output pulse shape, is given by 


2 
n= [(e+ 3+ 8) nt Crporan| hi — @) 


The constants are 


T = temperature 
h = Plank’s constant 


v = optical frequency 


» = detector quantum efficiency 
= electron charge 
fo = bit rate. 


The weighting functions 7; and Jz, which take account of pulse 
shape, are given by 


2 











_ fs [? |Hou(o) 

Lie is H(w) |“ . 
= 1 y Aut (w) ‘2 w 

ts = Of, [ Royo 4) 








where H,,,(w) is the Fourier transform of the output voltage pulse 
shape, H ,(w) is the Fourier transform of the optical power pulse shape, 
and the pulses have been normalized so that the area of the optical 
pulse is unity, as is the magnitude of the output pulse at the center of 
the time slot. The functions J, and J. can be shown to depend only on 
pulse shape relative to the time slot length, not on baud rate. 

The probability of error can be readily derived from eq. (1), assum- 
ing Gaussian noise statistics. For Gaussian-distributed noise, the 
probability that the noise current will exceed a value D is given by 


1 D 
P(D) = 5 erfe (se) ; 
where erfe is the error function complement and J the root mean square 


“Note that amplifier shot noise has been included in 1:. 
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noise current.6 We assume an ideal regenerator where a ‘“one’’ is 


produced if the input exceeds a threshold level D and a “‘zero’”’ other- 
wise. Then if a “‘zero” is transmitted, the condition that the probability 
will be P, that the noise will exceed the decision threshold is given by 


D > QI(0), (5) 
where 


Q = v2 erfe™ (2P.). (6) 


Similarly, so that the probability that the noise will not exceed the 
signal when a “one”’ is transmitted, the expected value of the signal, 
given by Pinaxne(g)/hv, where (g) is the mean avalanche gain and pmax 
the average power for all ‘‘ones,’’ must exceed the threshold by Q 
times the noise current; that is, 


Pmaxne(g) _ D > QL max. (7) 
hy 
From eqs. (1), (5), and (7), the average power required to achieve 
a specified error probability with avalanche gain is given by 


_ ea ees 2 nb (8) 


2n (g’) (ge 


where (g?) is the mean square avalanche gain. (In the case without 
avalanche gain when thermal noise predominates, the first term in the 
bracket of eq. (8) can be neglected.) For avalanche photodiodes, it has 
been found that 





i oa aia (9) 


and for silicon units x = 0.5. 

A value of (g) exists which optimizes performance.’ The value which 

minimizes the power required to achieve a given error rate, found by 
minimizing eq. (8), is given by 

243 ‘ 

dos = Qn” 


At optimal gain the power required to achieve a specified error rate 
is given by 


(10) 





=o ne (11) 
NEGopt 
BhvQ5!3 fin? 

= er ign (12) 
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It is interesting to note that eq. (11) can also be put in the form 


3 
ee mo (13) 


Jopt 





where p’ is the required power without avalanche gain. From eqs. (8), 
(10), and (12), the required power without avalanche gain, the optimum 
avalanche gain, and the required power with optimum avalanche gain 
are proportional to the second, third, and sixth roots of the thermal 
noise, respectively. 

The series noise source of a junction field effect transistor (FET) is 
virtually independent of the circuit parameters in the normal range of 
operation, and the shunt noise source is negligible at 6.8 Mb/s without 
input tuning. For an FET® 

ee SY kT ( of ) ; (14) 
Ym 
where gm is the transconductance of the device, so the thermal noise 
referred to the detector is 


2 
ners KT fo | ( Fe + om) te a], (15) 





assuming all the noise is due to the first stage of the input amplifier. 
For a good FET the input resistance is virtually infinite, so 


R= R,. 


In a common-source configuration, C is the sum of the drain-gate, 
gate-source, diode, and gate wiring capacitances. 


Ill. CIRCUITRY 


Figure 2 is a block diagram of the repeater, which was constructed 
in a 54 X 4 X 13 inch enclosure. The main signal path is represented 
by heavy lines and boxes. The signal, which is detected by either a PIN 
or a silicon avalanche photodiode, is first amplified by a high-impedance 
input amplifier. Following this, additional gain is provided by an 
SN52733 integrated video amplifier. Next, the signal is equalized, then 
further amplified by another §SN52733 video amplifier and filtered by 
a single-section maximally flat LC filter with a 7-MHz bandwidth 
which, in combination with the other amplifiers, gives a 3-dB point of 
about 6.3 MHz. From this point the signal is fed to the timing circuits, 
the automatic gain and threshold circuits, and the regenerator. Finally, 
the regenerated signal is amplified and applied to the LED. 
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Fig. 2—6.3 Mb/s repeater. 


3.1 Input amplifier 


A 3-stage input amplifier was employed, as shown in Fig. 3. The 
amplifier consists of a 2N4416 junction field effect transistor followed 
by a 3N159 tetrode amplifier, and finally a 2N4416 in a source follower 
configuration. It was found experimentally that input amplifiers with 
a 2N4416 input stage had an input noise equivalent power about 1 dB 
less than with the 3N159. However, the tetrode can provide more gain 
for a single stage because of its low drain to first gate capacitance. 
Furthermore, the tetrode is well suited to automatic gain control since 
the gm of the device is highly dependent on the voltage applied to the 
second gate. Thus, the configuration of Fig. 3 was chosen. It was found 
that the input noise dropped by 6 dB when the first gate of the tetrode 
was shorted to ground, so 75 percent of the thermal noise originates in 
the first stage. Thus, this configuration is very close to optimum. The 
source follower was provided to decouple the input amplifier from the 
subsequent circuits. 


3.2 Equalization 


In order to compensate for the distortion introduced by the long 
input time constant, the circuit of Fig. 4 was employed. With a source 
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—90V AGC +35V 





Fig. 3—Input amplifier. 


of resistance R, and a load whose resistance is included in Re, it can be 
shown that the transfer function has a pole at 


ae 1 Ry 
oe op (3 v Ree zs) 
and a zero at 
Rea ed 
= CiRy 


Thus, the position of the zero can be adjusted by varying either C: or 
R, and, as long as 
Ry > Rs + Re, 


the pole will be above the band of interest and have a negligible effect. 


3.3 Clamping and Peak Detection 


Since the amplifiers of the repeater are ac-coupled, the dc level 
would be a function of word pattern unless suitable provisions are made. 





Fig. 4—RC equalizer. 
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5 CLAMPED OUTPUT 





Fig. 5—Clamp and peak detector. 


In addition, provision to measure signal level must be incorporated to 
set the threshold and to control gain. Clamping and peak detection 
were employed to solve these problems. 

By incorporating both automatic threshold and gain controls, the 
AGC gain need only be high enough to assure that the phase-locked 
loop will function properly and to prevent compression. This reduces 
the tendency toward instability because of a high-gain feedback loop. 

Figure 5 shows the circuit employed. Diode D,; and capacitor C, 
serve as the clamp, and diode Dz and capacitor C2 serve as the peak 
detector. The diodes D3, Ds, and Ds; are included to cancel the diode 
drops of D; and Dy. 


3.4 Digital Circuits 


The timing coincidence and regenerator circuits, Figs. 6 and 7, share 
an SN72810 dual comparator and an §8N7474 dual D flip-flop. The 
comparator has the property 


Vo = a Vi > Ve 
Vo = 0, Vi < Vo, 


where 1 and 0 represent states. For the D flip-flop, the output Q takes 


INPUT v 










OUTPUT 





PEAK 
DETECTOR 
OUTPUT 


Fig. 6—Regenerator. 
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Fig. 7—Timing coincidence. 


on the value applied to the D input when the clock input changes from 
low to high. This value is held until either the clock again shifts from 
low to high or the clear ¢ is returned to ground. The output OQ is the 
complement of Q. 

In the regenerator, Fig. 6, the comparator serves as a quantizer and 
the D flip-flop retimes the signal. On the flip-flop, feedback from Q to ¢ 
is provided so the D flip-flop output will be a pulse of proper duration 
whenever the clock goes positive and the D input is high. . 

The circuit shown in Fig. 7 is used to adjust timing coincidence 
between the phase-locked loop output and the regenerator. The D 
input to the flip-flop is kept high at all times. Adjusting the input to 
V. of the comparator adjusts the delay between the time when a 
positive clock pulse is applied and the voltage of the clear (c) drops to 
a sufficiently low value to clear the flip-flop. The rising edge of the Q 
output is used to trigger the regenerator. 


3.5 LED and driver 


The LED driver consists of a cascade of two emitter followers. The 
driver is capable of generating 1.5-A pulses into a diode load. For the 
tests to be described, the diode was driven with 300-mA peak current 
pulses and generated optical pulses of about 0.3-mW peak power. 


IV. RESULTS 


Both signal-to-noise ratio and error-rate measurements were made 
to evaluate the input amplifier and repeater performance. The 2N4416 
JFET employed for all the tests had an input capacitance of 5 pF 
and gm of 0.006 mho. An additional picofarad of capacitance was added 
by the diode load circuit. The measurements without gain were per- 
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formed with an SGD-040A PIN photodetector which had a capacitance 
of 2 pF and a quantum efficiency of 83 percent. For the measurements 
with gain, a TIXL56 silicon avalanche photodetector was employed. 
This diode had a capacitance of 1 pF and a quantum efficiency of 
55 percent. For both diodes, the noise calculated from the measured 
dark current was negligible. 

The constants J; and J, have been evaluated by Personick® for 
rectangular optical pulses and pulses with a raised cosine spectrum 
and maximum eye opening at the regenerator input. He found J; = 0.6 
and I, = 0.26. The theoretical curves presented here were obtained 
using these values. 

Measurements of noise equivalent power, that is, optical power to 
achieve a unity signal-to-noise ratio at the regenerator, were first made 
to evaluate the performance of the input amplifier. As shown in Fig. 8, 
the results for the input amplifier of Fig. 3 with equalization closely 
approximates those predicted from eq. (8) by setting Q = 1. The 
difference between the theoretical and the experimental curve is 
mainly due to the noise of the second stage, which was about 1 dB 
with the 1-MQ diode load resistor. 


61 





60 


NOISE EQUIVALENT OPTICAL POWER IN —dBm 


1 2 4 6 810 20 40 60 100 200 400 600 1000 
DETECTOR LOAD RESISTANCE IN kQ 


Fig. 8—Noise equivalent power circuit vs. diode load resistance. 
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Other input amplifiers were also tried. It was found that with a com- 
mon-drain first stage and a common-source second stage, with a 
1-MQ diode load resistor, the noise equivalent power was about 1 dB 
higher. With smaller values of diode load resistance, where the resistor 
noise predominates, the common-drain input amplifier’s performance 
was identical to that of the common-source input amplifier. Similar 
results were obtained with a cascade configuration using a bipolar 
transistor for the second stage. 

Figure 9 shows ‘“‘eye’’ diagrams taken at key points in the receiver 
with a 1-MQ diode load resistance. Before equalization, the ‘‘eye’’ is 
fully closed, as is expected. After equalization, the ‘‘eye’’ is almost 
fully open. The regenerated pulse was photographed with recovered 
timing. 

The tracking of the threshold level with peak signal is shown in 
Fig. 10. The tracking in conjunction with the AGC is adequate, as will 
be apparent from the error performance. With germanium or Schottky 
barrier diodes in the clamping and peak detection circuit, the tracking 
could have been held even closer to the ideal, had this been necessary. 
The AGC also functioned properly. The signal level could be held 
within about a 20-percent range with the power up to 10 dB above the 
signal required for 10-° error probability. Greater range could be 
achieved by cascading tetrode FET stages, if required. 





Fig. 9—“Eye” diagrams: (a) Before equalization. (b) After equalization. 
(c) Regenerated pulse. 
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THRESHOLD VOLTAGE 
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Fig. 10—Threshold voltage vs. peak voltage. 


Error probability measurements were made under a variety of con- 
ditions. The results are shown in Fig. 11. The signal source was a 2!°-1 
bit pseudo-random word generator. The signal consisted of 15 bit 
blocks each separated by a zero. The measurements were made with ex- 
ternal timing and performance was optimized at each point, except 
the points indicated with x’s. For these points, which were taken to 
check the performance of the automatic gain and threshold controls, 
as well as the timing recovery, the system was optimized with recovered 
timing and AGC at an error probability of about 10-°. Then all the 
points were taken without further adjustment to the repeater. 

The theoretical curve for a 4-kQ diode load resistor is shown along 
with the measured points. These points were taken with the source- 
follower input amplifier because this amplifier introduced less inter- 
symbol interference with the 4-kQ diode load resistor. The improvement 
with a 1-MQ diode load resistor over a 4-kQ one was about 8 dB, which 
is In agreement with the theory. 

Measurements of repeater performance were made with a TIXL56 
silicon avalanche photodiode. This diode exhibits a significant diffusion 
tail. A second stage of RC equalization was employed to remove the 
resulting intersymbol interference. 
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Fig. 11—Error probability vs. average optical signal power. 


Equation (10) indicates that the optimum avalanche gain is a 
function of error rate. However, since it is not practical to optimize the 
gain at extremely low error probability, the gain was optimized at an 
error probability of 5 X 10-7 and then held constant for all the points. 

Table I shows a comparison of the measured and predicted values of 
Jopt for 4-kQ and 1-MQ diode load resistance. In view of the assumption 
of noise statistics and diode characteristic, the agreement is satisfactory. 


V. CONCLUSIONS 


It has been demonstrated that a simple compact low-cost* repeater 
suitable for fiber optic applications can be built which functions close 
to theory at 6.3 Mb/s. The high-impedance input amplifier and its 
associated equalizer were realized in a straightforward manner, and 
compression did not turn out to be a serious problem. A significant 
reduction in required signal power to achieve a specified error rate with- 


*The cost of the active components was about $30, exclusive of the detector and 
LED. An SGD-040 PIN detector costs about $15 and a TIXL56 avalanche detector 
$65. 
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Table | 


Detector Load Ohms Theoretical Gain Measured Gain 
4K 171 188 
1M 44 62.2 


out avalanche gain was achieved. With an avalanche detector, the 
optimum gain was greatly reduced with high impedance input, as 
predicted. Thus, the temperature stability will be greatly increased and 
the diode fabrication requirements eased with an avalanche detector. 
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Slab-Coupled Waveguides 


By E. A. J. MARCATILI 
(Manuscript received October 16, 1973) 


The slab-coupled waveguide, consisting of a dielectric rod lying on a 
slab that in turn covers a substrate, is a multidielectric waveguide that 
includes such special cases as the single-material fiber, the rib waveguide, 
and the strip-loaded film guide. These guides have recently become known 
as potentially useful either for long-distance optical transmission or for 
antegrated optics. 

Simple, closed-form, approximate solutions have been found to describe 
the following properties of the guide: number of modes, their field con- 
figurations and propagation constants, numerical aperture, requirements 
for single-mode operation, field penetration in the slab, tolerance to curva- 
ture of the guide axis, dispersion, and impulse response. 


|. INTRODUCTION 


Descriptions of three novel dielectric waveguides of wide potential 
use in long-distance optical transmission and in integrated optics have 
appeared in the literature recently. These guides are the single-material 
fiber!? (Fig. 1) made of low-loss undoped fused silica; the rib wave- 
guide’ (Fig. 2) made of two materials, and the strip-loaded film guide*® 
(Fig. 3) made of three materials. In all these fibers n3 is air or an inert 
atmosphere, while in a more general guide it could be another dielectric. 

Although these guides have different shapes and different distribu- 
tions of refractive indices, they have essential elements in common 
that make them close relatives of the same family. A more generic 
member of this family of waveguides (Fig. 4), from which all the others 
can be deduced, is a fiber of arbitrary cross section at a distance / from 
a slab mounted on a substrate. The way in which this guide operates 
is simpler to understand than the others, and is described below. 

The modal spectrum of the fiber (1 = ©) is shown in Fig. 5(a). In 
this example, five modes are guided and their axial propagation 
constants k, lie between kn3 and kn2 where k is the free-space propaga- 
tion constant 27/. Smaller propagation constants than kn; belong to a 
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La] 
20 um 





Fig. 1—Photographs of an experimental (a) multimode single-material fiber and 
(b) single-mode single-material fiber (top), with magnified core region (bottom) 


(n > ns). 


continuum of radiating modes that are unimportant for this discussion. 
On the other hand, the isolated slab (J = ©) supports modes with q 
extrema in the y direction. For simplicity we will assume that the 
field is well confined within the slab. The field components vary 
sinusoidally along 2, y, and z and the respective propagation constants 
kz, ky = wq/t and k, are related by the characteristic equation 


k2n2 — k2 = (3Y. 


Since k, can take any value between zero and infinity, the propagating 





n3 
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Tt 
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Fig. 2—Rib waveguide (n > m and n3). 
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Fig. 3—Optical strip line (n > ni, m2 and ng). 


modes of the slab with one maximum along y, (¢ = 1) constitute a 
continuum with axial propagation constants ranging from zero to 


Qyp2 _ r\ 
ken (4) 


as shown in Fig. 5(b). Similarly, propagating slab modes with two 
extrema along y, (¢q = 2) constitute another continuum with propaga- 
tion constants k, ranging from zero to 


2 
om) 
as shown in Fig. 5(c), and so on. 
Now let us imagine that, as in Fig. 4, the fiber and the slab are 
separated by a finite distance / which is far enough that their respec- 


tive spectra are only slightly perturbed by the coupling. Modes with 
the same propagation constant k, will couple to each other. Therefore, 


~~ 






po 


Fig. 4—Slab-loaded waveguide. 
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WV Kana (3) 


Fig. 5—Modal spectrum in: (a) isolated fiber of Fig. 4; (b) isolated slab of Fig. 4 
(only for modes with one half period across ¢; (c) isolated slab of Fig. 4 (only for 
modes with two half periods across ?). 


modes 1 and 2 of the fiber will remain guided without attenuation, 
though there is indeed an electromagnetic field in the slab that decays 
exponentially in the x direction (Fig. 4) away from the fiber. Mode 3 
will couple to the slab mode with the same k, of the spectrum in Fig. 
5(b); modes 4 and 5 will couple to modes of spectra in Figs. 5(b) and 
5(c), etc. The net result is that modes 3, 4, and 5 of the fiber will be 
attenuated by coupling to slab modes. The smaller the distance | 
between fiber and slab, the tighter the coupling and consequently the 
higher the attenuation of these leaky modes. From Fig. 5 it becomes 
obvious that by adjusting the thickness ¢ of the slab, the number of 
lossless modes can be selected. 

The most important point from this discussion is that only fiber 
modes with axial propagation constants k, larger than the propagation 


constant 
2 
one — | = 
ken ( ; ) 


[Fig. 5(b)] of the slab’s fundamental mode are lossless. . Therefore, 
throughout the paper we will be concerned with the coupling between 
the field in the core’s guide and the fundamental mode of the slab. 

Having identified the three basic elements of this waveguiding struc- 
ture, a slab, a guide (also referred to as a fiber or strip), and the coupling 
between them, we will use the generic name of slab-coupled guides, 
fibers, or strips for all the members of this prolific family. Of course, 
we will reserve the names given by the original authors to identify 
individual guides. 

The general solution of the slab-coupled guide in Fig. 4 would en- 
compass as particular cases those in Figs. 1, 2, and 3. However, only 
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Fig. 6—Slab-coupled waveguide. (a) Original guide. (b) and (c) Equivalent simpler 
guides. (d) Single-material guide. 


two extreme cases seem amenable to closed-form calculations: the 
case of feeble coupling described in Ref. 6 and the case of strong cou- 
pling occurring when the separation between fiber and slab vanishes, 
and which is the subject of this paper. 

In Section II we consider the properties and characteristics of a 
somewhat generalized slab-coupled guide [Fig. 6(a)]. These are the 
following : 


(t) the equivalence to a much simpler guide shown in Fig. 6(c), 
(27) the number of guided modes, 
(ii7) their propagation constants and field configurations, 
(tv) the numerical aperture, 
(v) the design for single-mode operation, 
(vi) the field penetration in the slabs, 
(vit) the tolerance of the guide to the curvature of its axis. 


These general results are applied to the multimode and single-mode 
single-material fibers, rib guides, and strip-loaded guides in Sections 
III, IV, V, and VI, respectively. Furthermore, dispersion and impulse 
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response in single-material fibers are considered in Sections III and IV. 
The simple but burdensome mathematics involved are placed in the 
appendices to this paper. 


Il. SOLUTION OF THE SLAB-COUPLED GUIDE 


Consider the somewhat generalized slab-coupled guide of Fig. 6(a). 
It is shown in Appendix B that if 


(t) most of the electromagnetic energy travels within the region 
of refractive index n, 

(7) the height and the width of the core are almost constants, 

(477) there are no turning points within the core (0 < x < Wax) 

or, in other words, exponential decay of the field components in 

the region of refractive index n occurs only in the slabs, and 


Ni; 
(2v) n> 4 Na, (1) 
n3 


then the four-dielectric guide with somewhat arbitrarily shaped core 
in Fig. 6(a) is equivalent to the single-dielectric guide with rectangular 
core in Fig. 6(c). Surrounding the dielectric of refractive index n, 
there is a material into which there is no field penetration and the 
x and y field components of the guided modes vanish at the interface. 
The dimensions of this equivalent guide are, according to eqs. (83), 
(84), and (85), 


T a i(1 ate C1), (2) 
W = w(l is Cw), (3) 
H = h(1 + en), (4) 


in which the quantities ¢c:, cw, and c,, which are small compared to 
unity, are from (73), (74), (79), (86) to (89), (100), and (101). 


: ( 4 ~ ) for modes polarized along x 
nee 12 12 6) 
1 3 ] 
ia ( 7 + 7. ) for modes polarized along y 
2 
2h = for modes polarized along x 
WU3 N 


= (6) 


2h for modes polarized along y 
WV3 
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Ea + y, tanh ( ay + tanh7 2 ) 
Vy a h U3 
for modes polarized along x 
Ch = : (7) 
NY Nye 


2 
+ —* tanh ( 9 + tanh! cae ) 


ny, nv, nev; 


for modes polarized along y, 





where 
01,2,3 = khvn? — Ni23 Ry (8) 
Wavas 
i= ae s, (9) 
Wmax 
ON Hie (10) 


and Amax and Wmax are the maximum height and width of the portion 
of the core with refractive index n in Fig. 6(a) and s is its cross-sectional 
area. All these expressions are valid for 


Ds = ae 
= — tan“ a for modes polarized along x 
2 ny — N13 
v> (11) 


2 2 2 
T ne [rr—n : 
i tan“ a ene for modes polarized along y, 


vy = kiNn? — ni. (12) 


Four parameters, then, », T, W, and H, determine the guide in 
Fig. 6(c) and we proceed to characterize its transmission properties. 

The guided modes are hybrid; however, the longitudinal field com- 
ponents (along z) are small compared to the transverse ones; there- 
fore, the modes are almost transverse electromagnetic (TEM). Within 
the core, these transverse field components vary sinusoidally along 
x and y. Within the slab, the field components also vary sinusoidally 
along y but decay exponentially away from the core. All these com- 
ponents vanish at the edge of the guide. 

There are two families of modes, EH}, and Hj,. The first family, Fig. 
7(a), is mostly polarized along y, and the main transverse field com- 
ponents are #, and H,. Within the core, a mode has p field extrema 
(approximately p half periods) along x and q field extrema along y. 


a” 


where 


T Throughout this paper numbers or letters separated by commas must be con- 
sidered one at a time. 
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Fig. 7—Two families of modes. (a) #¥, modes and (b) #7, modes. 
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The second family of modes, Fig. 7(b), is mostly polarized along x 
and the main transverse components are #, and H,. 

For both families, the axial propagation constant k, and the field 
penetration in the slabs d,,, that is the distance over which the field 
components decay in the slabs by 1/e, are according to (98) and (99), 


i tN [witrwl-() 08 


where c, taken from (97) is 
a 2 Tf 1 
‘ «WH is (4 ) 








(15) 
H 


The highest-order modes, which we will designate with indices 
p = P and q = Q, are those for which the penetration depth dpg is 
infinite. For them, eqs. (13) and (14). are reduced to 


Kaa A 
aft — “ie = on = NA. (16) 


ei G\ie wv 


and 





While for ordinary fibers, the numerical aperture (N.A.) Vn? — 2 
is an exclusive function of the refractive indices of the core n, and the 
cladding n., the N.A. of a slab-coupled guide defined in (16) is mostly 
a function of the wavelength and the slab thickness. The longer the 
wavelength and the thinner the slab, the larger the numerical aperture. 
This statement is true provided that the inequality (11) is satisfied. 

Naturally, an equivalent cladding refractive index 


Ne =4/Nr? — (sr) (18) 


is derived by equating kz min/k to ne. 

The easiest property to observe in slab-coupled guides is, probably, 
the number of spots of the highest order mode guided. That number is 
the product of P and Q which are related to each other by eq. (17). 
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Fig. 8—Waveguide dimensions for £3 mode at cutoff. 


A plot using PT/W and QT/H as coordinates and T?/WH as the 
parameter is shown in Fig. 8. Given T'/W and T/H, the parameter 
T?/WH, which selects one of these curves in Fig. 8, is also known. For 
the ordinate T/W corresponding to P = 1, the abscissa Qimax T'/H is 
determined and from it the maximum number of half periods Qmax 
of the modes H7Z,,,, in the y direction. Similarly for the abscissa T'/H 
corresponding to Q = 1, the ordinate Pmax T/W yields the maximum 
number of half periods Pmax of the modes E%%,., in the x direction. 

Example: For T/H = 0.5 and T/W = 0.1, the values Pmax = 8 
and Qmax = 1 (rounded off to the immediate lower integer) are 
obtained. 


The explicit values of Pmax and Qmax derived from (17) are 


ron Y(t] 
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and 


On ar es G7) (1 A =n) | (20) 


From this last equation, 
H 


Dinas 


Consequently, for any guided mode the slab thickness 7 is always 
smaller than the half period of the mode in the core along y. This 
justifies one of the assumptions in Appendix B. 

The guide is largely overmoded if 





ie (21) 


T 
7 <1 (22) 
and 
T 
H <1; (23) 


then, the number of modes for each polarization deduced from (19) 
and (20) results in 

a WH 
Unlike ordinary fibers, the number of modes of a slab-coupled guide 
is mostly determined by its geometry. 

To dimension the slab-coupled guide for single-mode operation, 
eq. (17) has been plotted in Fig. 9 using T/H and T/W as variables 
plus two sets of parameters P = 1, Q = 2 and P = 2, Q = 1. The 
coordinates of the first line give the dimensions of a guide with the Ej, 
and HY, modes at cutoff while those in the second line yield the guide 
dimension with the #3, and £3, modes at cutoff. The solid portions of 
both curves determine the smallest possible ratios T/W and T/H 
compatible with single-mode guidance. The discrete numbers on the 
curves indicate the ratio H/W. For a square core (H = W) we deduce 
from the figure that T7/W ~ T/H ~ 0.5. 

For the fundamental mode, the field penetration in the slabs, di1, 
is obtained from (14), making p = q = 1. With rearranged terms, 
eq. (14) reads 

W 1 2T 1 


"i G)- Ga) (a) 
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(25) 
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Fig. 9—Waveguide dimensions for Ej,” and £3,” modes at cutoff. 


and it is plotted as solid lines in Fig. 10 using T7/H and T/W as co- 
ordinates and 7'/di; as the parameter. 

In the same figure, the dotted line is a reproduction of the curve of 
Fig. 9 corresponding to the cutoff condition of the Ej? and 3 modes. 
The intersection of this curve with the others yields, then, the field 
penetration of the fundamental modes in guides designed to be at 
cutoff for the next higher order modes. 

The region between the solid curve with parameter T/dis. = 0 
(infinite field penetration in the slab) and the dotted curve delimits 
the possible choices of T7/H and T/W for single-mode waveguides. 
The region within the dotted curve corresponds to multimode wave- 
guides. 

Let us consider now the attenuation of the fundamental mode due 
to radiation induced by the curvature of the guide’s axis. If the guide 
axis 1s bent in the plane of the slab along a constant radius of curvature 
R, the attenuation of the fundamental mode in a 90° bend’ is pro- 
portional to 

2 
Rexp| — saa aE | (26) 
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and it is negligibly small if? 
2 
R=24 ( =) d3,. (27) 


The tolerable radius of curvature decreases rapidly with dy. 

Shorter radii of curvature can be negotiated in the plane perpen- 
dicular to the slabs if, as it happens in general, the field penetrations 
from the slabs into the media of indices n; and n3 are smaller than dy). 

We can reuse Fig. 10 by substituting the parameter T'/dy, with its 


equivalent 
n2 T3 3 


deduced from the equality in (27). For single-mode waveguides, the 
shortest dy, and consequently the shortest tolerable radius of curva- 
ture is achieved for T/H ~ T/W ~ 0.5. 

The pertinent calculations for curvature-induced losses in multi- 
mode slab-coupled guides are carried on in Section III, where multi- 
mode single-material fibers are considered. 


T/W 





0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 
T/H 


Fig. 10—Field penetration in slab d,, and tolerable radius of curvature & for 
fundamental mode. 
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lif. MULTIMODE SINGLE-MATERIAL FIBERS 


Single-material fibers supporting any number of modes, Fig. 1(a), 
are characterized by 


N= Ne, = 23 = im (28) 


Under these circumstances, the location of the slab with respect to 
the core in Fig. 6(a) is not taken into account by the theory presented 
in this paper and, consequently, a more general cross section of single- 
material fibers is shown in Fig. 6(d). Figure 6(c) is still its equiva- 
lent (Appendix B). 

Multimode single-material fibers satisfy not only (28) but also the 
following inequalities 


> 1. (29) 


~|S ale all> 


Therefore, parameters (2), (3), and (4) defining the guide are sub- 
stantially simplified 


T St (30) 
W=w (31) 
H=h (32) 


and are valid for all polarizations. 

According to (30), (81), and (32), the electromagnetic field is well 
confined within the guide. Using these values, the numerical aperture 
(16), the equivalent external refractive index (18), the number of 
modes for both polarizations (24), and the propagation constant for 
each mode (13) are 


IN 
NA.=5«1, (33) 
vr \2 1f/r~\2 
= 2> aed, ~ reece (uaa 
eae (3) =-[2 Teak el 
rs 
MS (35) 
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and 
2 
ke = 4/k?n? — (=) ae (3). (36) 


where S is the core cross-sectional area. 

Unlike ordinary guides the number of modes N is independent of 
the free-space wavelength \. In other words, by keeping 2 fixed, the 
scale of the guide’s cross section could be changed without vary- 
ing the number of guided modes! How is it possible? The follow- 
ing is a plausible argument. For a given wavelength, if S is increased 
the number of guided modes in the core should increase, but simul- 
taneously the number of modes that can escape through the enlarged 
slabs is also increased. The fact that both increases compensate for 
each other can only by justified with the mathematics. 

Let us turn now to modal dispersion. Calling LZ the length of the guide 
and c the speed of light in free space, the group delay spread between 
any mode with propagation constant k, and the fundamental one? 
(which has a propagation constant very close to that of a plane wave 
in a medium of refractive index n) is 


_Ld 


Te Se (kz — kn). (37) 
With the help of (86) 
Ln {kn 


The maximum time spread occurs for the highest order mode which 
has the smallest k, value, kn.. Then, using (34) for the value of 7., 


L n L {xr 
roe = Gn(—1)= (2): (39) 


The impulse response is similar to that of clad fibers. A short im- 
pulse feeding equally all of the guided modes arrives at the other end 
as many impulses unequally displaced in time.!° However, the power 
density of the arriving pulse is uniform over the time interval Tmax 
given in (89) and zero elsewhere. This impulse response width being 
inversely proportional to the square of the slab thickness can be 
shortened by increasing t. 

Since there is more familiarity with clad fibers than with single- 
material fibers, it is of interest to make a comparison between them, 
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assuming both guide the same number of modes N, and have the 
same modal dispersion spread 7max. For a clad fiber of radius a and 
core and cladding refractive indices n and n(1 — A), those values 





are}! 9 
N= (7 )' (40) 
and 
Tnx = And. (41) 


These two equations together with (85) and (89) are plotted in Fig. 11. 
The group of curves on the lower part corresponds to single-material 
guides, and those on the upper part correspond to the equivalent clad 
fiber. Dotted lines are for the modal dispersion spread and solid lines 
for the number of modes. The parameters are either the core diameter 
of the clad fiber or the square root of the core cross section of the single- 
material fiber normalized in both cases to the free-space wavelength. 
Example: For a dispersion spread 7 of 26 ns/km and N/n = 100, 
the single-material fiber dimensions are t¥n/d = 4 and VS/\ = 31.6, 
while those of the equivalent clad fiber are nA = 0.008 and 2a/\ = 36. 
If the multimode single-material fiber is bent, all the modes become 
somewhat lossy; however, as in ordinary clad fibers the radiation loss 
is significant only for those modes whose plane wave components ex- 
ceed the critical angle. Unlike clad guides, though, a bend in the plane 
of the slabs produces higher losses than a similar bend in the perpen- 
dicular plane. This is due to the fact that, for a given mode, the field 
penetration in the slabs is far larger than in the material of index 7. 
The single-material fiber bent in the plane of the slabs on a radius 
of curvature R has a numerical aperture N.A.’ and guides a number 
of modes” N’, both of them smaller than the N.A. (83) and the 
number of modes (85) of the straight guide. As a matter of fact, 


ne ee x E = a y | (42) 
and 
w= 55 ([1-28( >) |: (43) 
Only half of the modes remain guided if 
hy = u( SY. (44) 
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Fig. 11—Multimode single-material fiber and its equivalent clad fiber. 
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Example: For n = 1.5,t/\ = 2and w = 504, from the previous formula 
follows 
Ry = 7.2 mm, 


a small radius indeed. 


IV. SINGLE-MODE SINGLE-MATERIAL FIBERS 


In single-mode single-material fibers, Fig. 1(b), of interest for 
optical communication, all of the dimensions are large compared to 
the wavelength of operation. Therefore, eqs. (28) through (32) are 
valid and all formulas and figures of Section II related to the funda- 
mental mode in slab-coupled guides are applicable just by changing 
T, H, and W into t, h, and w, respectively. 

The dimensional requirements for single-mode operation with the 
next higher order at cutoff are determined by the solid line in Fig. 9. 
For example, it may be that for splicing purposes it is desirable to 
have a core of square cross section; then, from that figure, 


h = w = 2t. (45) 


As in the multimode case, these dimensional requirements are inde- 
pendent of the wavelength and, consequently, the cross section of 
the guide can be scaled to satisfy other demands, such as relief of 
splicing tolerances, simplicity of fabrication, etc. 

The propagation constant of the fundamental mode for both polar- 
izations (18), the field penetration in the slabs (14), and the tolerable 
radius of curvature (27) are 


1 1 
= 2y2 2 ane ae a eres 
ke ken | at aacap |: (46) 
i t= 4 1 
da Ne Bo wate (47) 
and 
24 /n\2 3 
r= (%) a (48) 
me wd +e)? 
where 
2 


Normalized values of di; and R can be found as parameters in Fig. 10 
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Table | 





10 17.0 40.0 330.0 925 
Mode at Cutoff Ez Ezy y Dieta Ej,” 


for different values of t/h and t/w. Points on the dotted curve belong 
to single-mode guides with either the E7? or #3’ modes at cutoff. 

Continuing with the practical example above in which h = w = 22, 
we obtain either from (47) or from Fig. 10 


dy => 0.42¢. (50) 


With the help of (48) or Fig. 10, one can calculate a table of tolerable 
radii of curvature (Table I) for the fundamental mode in a guide 
with either the H7¥ or H3¥ modes at cutoff and assuming n = 1.5 and 
A = 1 pm. 

Shorter radii of curvature are achieved for smaller ¢ and t/h ratios 
if the guide is designed for the E3/ modes at cutoff (see Table I, first 
three columns). As seen from the last column, guides designed for Hi 
modes at cutoff have longer tolerable radii of curvature. The guidance 
is not as tight and consequently less desirable. 

Let us turn to dispersion. Knowing k, (46) it is possible to calculate 
the dispersion (L/c)(dk./dk) in a guide of length L. However, a more 
interesting result is the guide response to a Gaussian input pulse of 1/e 
width TJ. Following standard techniques to calculate responses 
through linear devices, one finds the output to be close to another 
Gaussian whose 1/e width is 


OL ak, |? 
T = Tei +| om Se | (51) 


The second derivative is to be calculated at the wave number k, of the 
carrier and c is the free-space speed of light. 

For a given length of fiber L’, the input pulse width T> that mini- 
mizes the output pulse width (T’ = v2T7.) is related to L’ by 


22 
peo sw ns, (52) 


> (Be) 
Ok? J kxk, 
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Assuming (46) to be applicable and ci < 1, 


272 2apy2 
a me (53) 
Example: For 
n = 1.5, 
T., = 10 ps, 
h = w = 20um, 
and 
A= Lym, 
then, 
L’ = 34 km. 


As expected, the waveguide dispersion of the fundamental mode in a 
single-material fiber is very small and material dispersion may be more 
significant. 
V. RIB WAVEGUIDES 
These slab-coupled guides, Figs. 2 and 6(b), are characterized by 
Ne = n3 = 1, (54) 
eI, (55) 


and h slightly larger than t. Substituting (54) and (55) in (2), (3), and 
(4), the dimensions of the equivalent guides in Fig. 6(c) are 


t (1 + , ) for #5, modes 
f= (56) 
t (3 “bh ae ) for HY, modes, 
nv 


W=w for EZ! modes, (57) 
and 
h (1 + é) for E>, modes 
H = (58) 
nit f f d 
h Lay or EY, modes, 
where 
= ktvn? — n?. (59) 


Simplified by (54) and (55), eq. (11) says that expressions (56) and 
(58) are valid provided 
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In? — ni 
—s yee th for ESaq modes 
ee 


v> (60) 


ao + 4 (a for HY, modes. 


Using the values 7, W, and H given by (56), (57) and (58) in 
previous equations and figures, the following results can be ascertained: 


nla 


(z) Propagation constants of different modes and polarizations 
(13), 
(7c) Maximum number of half periods Pmax and Qmax, in the highest 
order modes /¥¥,,,, and H2%,,, [(19) and (20) or Fig. 8], 
(i772) Dimensions of the guide for single-mode operation with the 
next higher order mode at cutoff [(17) or Fig. 9], 
(iv) Field penetration in slabs d,; for the fundamental modes [ (25) 
or Fig. 10], 
(v) Tolerable radii of curvature in the plane of the slabs for the 
fundamental mode [ (27) or Fig. 10]. 


VI. STRIP-COUPLED GUIDE 
This guide, Figs. 3 and 6(b), is characterized by 


nz = 1, (61) 
a (62) 
ee (63) 
and 
| a (64) 


Substituting (61) through (64) in (2), (3), and (4), the dimensions 
of the equivalent guide in Fig. 6(c) are 


t (1 + x) for Ej, modes 

T = (65) 
t “( +i) for E¥, modes, 

W = for #7” modes, _ (66) 
t ita tat “ta nh “we ) for EZ, modes 

H = (67) 
t (1+ ue + ta nh ne ) for H¥, modes, 
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where 


U1,2 = kiNn? — nio. (68) 


These expressions are valid only if, according to (11) simplified by 


(61) through (64), 
2 m2 
= = for EZ, modes 
11> i 5 5 
Ie — a 7 for E>, modes. 
“/ Bess 


It should be noticed that H cannot be made much larger than T just 
by increasing hy because the field decays almost exponentially along 
y in the material of index nz, Fig. 6(b). As a matter of fact, the maxi- 
mum value H is 


| 
pola 


(69) 





t (1 + 4 + =) for EZ, modes 
1 2 
hig : F (70) 
t{i1+ Eli Ree ele for HY, modes 
nv, n'y? J us , 


As in the rib waveguide the values of T, W, and H in (65), (66), 
and (67) can be entered in previous formulas and figures to find 
propagation constants of modes polarized along either x or y (13); 
number of guided modes [ (19) and (20) or Fig. 8]; dimensions of guide 
for fundamental mode operation with the next higher order at cutoff 
[ (17) or Fig. 9]; field penetration in slabs for the fundamental modes 
[ (25) or Fig. 10]; tolerable radius of curvature for the fundamental 
mode [(27) or Fig. 10]. 


ACKNOWLEDGMENT 


The author is grateful to Mmes. W. Mammel and D. Vitello for 
their computational contributions and to Messrs. 8. E. Miller, J. 
Arnaud, and D. Marcuse for stimulating and demanding discussions. 


APPENDIX A 
Approximate solution of the slab 


Consider the two-layer slab in Fig. 12(a). Propagating along z, the 
field components are independent of x and the characteristic equa- 
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ng 


=> 
nN 


n2 


BD i 


«<——-—-—-- ><> 


ny 





(a) 


Fig. 12—(a) Two-layered slab. (b) Equivalent slab. (c) Field distributions (Ez 
and H, or £, and H,) in original slab (solid line) and in equivalent slab (dotted line). 


tion!’ is 

Kw 
mq = Yt tan? ———— 
g=¥ — 





Py ec K E 
+ tan | Narr tanh E vv3 — Y + tanh EN; = I} » (71) 


where 
Y= k,h (72) 


is the electrical height of the slab of index n; g is the number of field 
extrema within the slab; 





Ni2,3 i 
Ge je 2 for polarization along y (73) 
1 for polarization along z, 
V1,2,3 = khvn? — ni23. (74) 
Assuming 
ny 
n> {i (75) 
nN 
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the field components vary sinusoidally along y in the medium of index 
n and exponentially in the others. 

Real solutions of (71), which correspond to guided modes, exist for 
values of v; larger than 


_ 1 i Jn? — n3 
tin = 3 ( -5) tan |X: er 


ee 
stan | 92 Vi — of + tanh F a malt (76) 


K, ni = ne 








The simple asymptotic solution of (71) for 





v1 > Umin (77) 
is 
_ _ 7 
vy aad 1 + Ch’ (78) 
where 
_ Ki , Ke ho _, Ks v2 
cy = St + Stank ( Poe + tanh ea): (79) 


The asymptotic solution (78) and the exact solution have been 
plotted in Figs. 13(a), (b), and (c) for several cases of interest. 

Since the percentile errors are small, even close to the cutoff values 
of v1, expression (78) will be used throughout the paper. 

Now we proceed to find a much simpler slab [Fig. 12(b)] that is 
equivalent to that of Fig. 12(a) in the sense that both have the same 
propagation constants 


_ mg 
n> HP ep 0) 
and 
ke = Vien — Ke. (81) 


That is indeed the case if the slab in Fig. 12(b) has refractive index n, 
height 

H =h(1 + 6), (82) 
and is surrounded by a hypothetical dielectric* such that all the 
transverse field components (components along x and y) vanish at the 


interfaces. This dielectric plays only the role of confining the electro- 
magnetic field within the slab. 


*This hypothetical dielectric has infinite conductivity if H, ~ 0 or infinite perme- 
ability if H, ¥ 0. 
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Fig. 13—Exact and approximate width of dielectric slabs. (a) Symmetric slab. 
(b) ees slab (polarization along y). (c) Asymmetric slab (polarization 
along x). 
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A geometrical interpretation of the equivalence of the slabs can be 
gained from the field intensity distributions shown in Fig. 12(c). 


APPENDIX B 
Solution of the slab-coupled guide 


The exact solution of Maxwell’s equations for the dielectric guide 
whose cross section is shown in Fig. 6(b) is very difficult because the 
boundaries are not analytical. However, a good quantitative insight 
can be gained if, as in Ref. 14, good guidance is assumed, that is, if 
most of the electromagnetic energy is contained within the guide. 
Then the field in the shaded areas, Fig. 6(b), can be ignored and the 
slab solution of Appendix A can be applied independently to each of 
the finite slabs of widths h, w, and ¢ that make the guide. Thus, another 
dielectric guide, Fig. 6(c), is derived which is equivalent to that in 
Fig. 6(b) in the sense of having the same axial propagation constant k, 
and the same field penetration in the slabs of thicknesses ¢ and 7’. 
Unlike the original guide that has four dielectrics, the equivalent one 
has a single dielectric of index n and is surrounded by a hypothetical 
material that forces the transverse field components (along x and y) to 
be negligibly small on the boundaries and confines the electromagnetic 
energy within the guide. The dimensions of this equivalent guide are 
related to the original one by the following expressions derived from 
(82), 


H =h(1 +n), (88) 
W =w(l + cy), (84) 
and 
T=i+ce,), (85) 
where 
a (86) 
Uw 
_hf{ Ki , Ks 
eG a) ea 
ns f larizati I 
ox " or polarization along z, (88) 
1 for polarization along y, 
and 
i= kwvn? — n3, (89) 


and, because of the choice of symbols, c, coincides with (79). 
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To solve the boundary value problem of Fig. 6(c) we make further 
assumptions. One is that the slabs do not perturb the sinusoidal dis- 
tribution of field in the core and vice versa. Another assumption, 
justified qualitatively in the text, is that only the fundamental mode 
of the slabs of thickness 7 contribute significantly to determine the 
propagation constants of the modes of the guide. Then, the character- 
istic equations of the core and slabs are 


we =(B) + (HF) +H (90) 
ken? = —k?, + (F) + k2, (91) 


where p and q are the number of half periods along x and y, kz. is the 
propagation constant in the x direction within the slabs, and W, is 
the equivalent width of the core. W, is somewhat different from W 
because of the field penetration from the core into the slabs. 

To calculate W., we imagine the core divided by the dotted line in 
two regions, a and b. In region a the width is W. In region b the elec- 
trical width ¢ = k,W is given by the slab equation" 





mp=o-+2tan . (92) 


“ 
KkzsW 


With the value of k,, deduced from (90) and (91) and considering that 
W. = W, eq. (92) becomes 











m=o+2 tan aa vqii"\3 (93) 
(4) - (Hr) ~ oor 
and its asymptotic solution is 
a Ey eS EG 2) 
Eo: awW _ (ay 
H 


Following the procedure described in Appendix A, the equivalent 
width of the region 6 of the core results in 


2T 1 
Wr=W Lt on (95) 


=) 
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We will assume the equivalent width of the core W, to be a linearly 
weighted average of W and W,. Therefore, 


Wi ee Tae Wall 





oe H 
or 
W.= W{il+e), (96) 
in which 
2 Tf? 1 
“aWH [ (qty en) 
“FF 


From (90), (91), and (96), we derive the explicit values of the prop- 
agation constant of a mode and the penetration depth d,, in the slab 
over which the field decays by 1/e: 


ke = aftnt — | waeay | — (F) (98) 
and 
tu g-- Fy -|werag|-(a) © 


These results apply not only to the guides in Figs. 6(b) and 6(c) 
but also to the somewhat more general guide in Fig. 6(a) provided 
that (7) the curved edges depart only slightly from those in Fig. 6(b), 
(iz) no exponential decay of the field or, equivalently, no turning point 
is introduced by the wall deformations, and (777) h and w are chosen 
to be 








hinax 
h= ee S (100) 
and 
w= ral 8, (101) 


where hmax and Wmax are the maximum height and width of the core 
portion of index 7 in Fig. 6(a) and s, its cross-sectional area. 

If exponential decay of the field were introduced by the wall de- 
formation, the simple expression (98) developed for k, would not be 
applicable. 
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The choice of h and w in (100) and (101) are derived from the WKBJ 
method! or from the almost obvious demands: 





hw =s 
and 
h = Imax 
WwW Wmax 


which mean that both portions of core with index n in Figs. 6(a) and 
6(b) have equal surface and equal aspect ratio. 

After so many approximations one wonders about the percentile 
errors in the final results (98) and (99). We can have some impression 
of the precision achieved by checking (98) against the more exact 
results developed elsewhere!! in order to dimension an optical fiber of 
circular cross section at cutoff for the second mode, assuming small 
difference of refractive indexes between core and cladding. 

Calling a the radius of the fiber core, the pertinent values for (99) 
are 


ny = he = Nz, 


= 0, 
h = w= Vra from (100) and (101), 
2 
Ch = Cy = = from (79) and (86), 
a ag (79) (36) 
2 
W == Vara ( + =) from (83) and (84), 
Co = from (97), 
and 
dpq = 9, 
where 
V = kawvn? — nj. (102) 


Substituting the values of dyq, cq, H, and W in (99), one obtains for 
p = 1 and q = 2, or for p = 2 andg = I, 


V = 2.83, 


while the exact result is 2.4. The error of 18 percent is small indeed 
considering that at cutoff the assumption of negligible field outside 
of the guide is crudely violated. 
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Transverse Coupling in Fiber Optics 
Part Il: Coupling to Mode Sinks 


By J. A. ARNAUD 
(Manuscript received October 12, 1973) 


The number of modes that can propagate without radiation loss in 
oversized waveguides ts sharply reduced if the waveguide is coupled to a 
structure supporting radiation modes, the loss mechanism being analogous 
to Cerenkov radiation. The coupling formula derived in Part I! is used 
to evaluate the loss for a specific configuration: a reactive surface (e.g., 
a thin dielectric slab) acting as a waveguide, coupled to a semi-infinite 
dielectric acting as a mode sink. The method consists in first assuming 
that the substrate is finite in size and lossy and adding the losses associated 
with each substrate mode. The substrate dimensions are subsequently 
made infinite and the dissipation loss is made to vanish. The expression 
obtained for the radiation loss coincides with an expression obtained by 
solving the boundary value problem. The method is then applied to the 
problem of mode selection for dielectric rods coupled to dielectric slabs, 
which ts of particular importance for optical communications and inte- 
grated optics. A 2-dB/m radiation loss is calculated for the first higher 
order mode when the rod radius is 10 wn, X = 1 pm, n = 1.41, and the 
rod-to-slab spacing ts 0.15 wm. 


I. INTRODUCTION 


An expression for the coupling between lossy single-mode open wave- 
guides was derived in Part I.! We now investigate the coupling of a 
waveguide with finite cross section with a waveguide with infinite 
cross section (called a substrate), the latter supporting radiation modes. 
Radiation losses are suffered whenever the propagation constant h of 
the guided mode is smaller than the highest propagation constant h, 
of the radiation modes carried by the substrate. Radiation then takes 
place at the Cerenkov angle @ = cos" (h/h;). By properly choosing 
the dimensions and permittivities of the waveguide and those of the 
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substrate, it is possible to reduce the number of modes that can 
propagate without attenuation (in the absence of dissipation and 
scattering losses). This arrangement is of great practical importance 
because optical fibers are usually highly overmoded to facilitate fabrica- 
tion and splicing.” (For a coherent source, it is important to reduce the 
number of modes because different modes usually have different 
group velocities. If a short optical pulse is sent through the fiber, mode 
conversion takes place because of the imperfections of the fiber; this 
causes the pulse to spread in time.) The mode selection mechanism 
just described is also of practical importance in the microwave range 
for oversized waveguides such as oversized microstrips on dielectric 
substrates and oversized dielectric strips.* Multimoding in traveling 
wave tubes can also be avoided with the help of mode sinks. 

We investigate the loss mechanism for two specific configurations. 
First, a reactive surface acting as a waveguide coupled to a semi- 
infinite dielectric acting as a mode sink. We show that, by adding the 
losses associated with each substrate mode, an expression for the total 
loss is obtained that coincides with an expression obtained by solving 
the boundary value problem. Then the method is applied to the prob- 
lem of a dielectric rod coupled to a dielectric slab.” The case of dielectric 
rods coupled to dielectric cylinders supporting whispering gallery 
modes and acting as mode sinks? will be discussed in another paper. 


Il. RADIATION LOSSES IN SUBSTRATES—GENERAL FORMULA 


To evaluate the radiation losses, let us first assume that the trans- 
verse dimensions of the substrate are finite, and let hsz = Aer + the; 
be the propagation constant of a trapped mode in the substrate, with 
hs, real and h,; real positive (the subscript s stands for ‘‘substrate’’).' 
If h, denotes the propagation constant of a trapped mode of the wave- 
guide in the absence of the substrate, the propagation constant h of 
the coupled wave is, from eq. (6a) in Part I, 


h=hot+ 5 (Nez is ho) = La (hse _ ho)? *E Cry (1) 


where C? = c,c,/P.P, denotes the coupling coefficient defined in Part 
I. The minus sign before the square root has been selected because it 


“In the microwave range, there are no compelling reasons for using dielectric 
waveguides that are large compared with the wavelength in all dimensions, but we 
may want to use strips (either metallic or dielectric) whose widths exceed one 
wavelength for improved accuracy. 

t The dependence of the field on time (¢) and on the axial coordinate (z) is denoted 
exp [7(hz — wt) ]. This term is henceforth omitted. 
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corresponds to the mode whose field is concentrated in the waveguide 
cross section rather than in the substrate (that is, we require h = h, 
when C? = 0). 


Let us now assume that h, is real (lossless waveguide) and that 
hei > C. (2) 


Using this condition, eq. (2), we can expand the r.h.s. of eq. (1) in 
power series of C? and keep only the first two terms in the expansion. 
The loss is given by the imaginary part h; of h. Because the imaginary 
part of C? can be neglected in the case that we consider, we have 


h; ww) C*heil. (Aer — ho)? + hee (3) 


The total loss £& experienced by the waveguide is now obtained by 
summing over the various modes of the substrate: 


£= y Cahsil. sve — lis)? “i hers, (4) 


where the subscript a refers to the substrate modes. We have assumed, 
for simplicity, that h,; does not depend on a. It is shown in the next 
section for a simple configuration that in the limit of dense substrate 
modes eq. (4) is in agreement, with an exact result, obtained from a 
boundary value method. 

If we let the cross-section area S of the substrate tend to infinity, 
the substrate modes become denser and denser, and the summation 
in eq. (4) can be replaced by an integral 

& = lim DY C2haiL (bara — ho)? + hx 


S?o a 
- / C (her) Rail (er — Bo)? + 2 }dha, (5) 


where we have defined a coupling density @ by 
C(her)dher = lim D> C2, 
So «a 


the range of a being defined by the condition 
her < Reve < Nie + her. (6) 


This density exists because, as S— ©, the coupling coefficient C? 
decreases at least as fast as S~!, the power in the substrate being 
proportional to S if the power density is kept a constant. 

We can now let h,; tend to zero, the condition eq. (2) being pre- 
served. The second factor in the integrand of eq. (5) is sharply peaked 
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at hs, = h, and behaves as a symbolic 6-function. Thus, in the limit 
h.; > 0 we have 


£& = re(h,). (7) 


It should be noted that the subscript a in eqs. (4) to (6) stands for 
three subscripts m, n, and s, where m refers to modes in the z direction, 
n refers to modes in the y direction (we assume for simplicity that the 
substrate modes are separable in Cartesian coordinates), and s refers 
to the state of polarization (e.g., H or E modes). 


Il. COUPLING TO A SEMI-INFINITE SUBSTRATE 


Consider a reactive surface coupled to a semi-infinite dielectric 
(Fig. 1). We consider only H modes and assume that the field is 





(b) 


Fig. 1—(a) Reactive surface, with normalized susceptance a, coupled to a semi- 
infinite dielectric with permittivity « = n%e,.. For H modes, the structure is assumed 
terminated in the y direction by electric walls. Radiation takes place at the Cerenkov 


ou 6 = cos! ((k? + a*)t/kn], k = 27/d. (b) Variation of the field as a function 
of x 
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independent of the y coordinate. Except for the changes x @ y and 
1 — — 1, we use Shevchenko’s notation.‘ 

For waves propagating along the z axis, the electric field has only a 
y component that we denote H. In a region with constant ¢, EH’ obeys 
the wave equation 


PH /dz? + (wen. — WE = 0, 
H, = —h(wn.) 8, 
A, = (twp) dH /dz. (8) 


If « has a finite discontinuity, EF and dE/dx remain continuous. 
The general solution of eq. (8) for « = e, and e¢ are, respectively, 


E = Atetx® + A-e-txz, (€o) (9a) 
E, = Ajte= + Aze-z, (e) (9b) 
X? = wrote — h?, (10a) 


g? = wen, — A? = v2 + x2, 


uw? = w*(€ — €o) Mo. 10) 


The loss can be evaluated by solving the boundary value problem. 
At the reactive surface (x = —D), we have the condition (see Ref. 4) 


dE/dt +oE =0, x= —D, (11) 


where a@ is a positive real number proportional to the susceptance of the 
surface.* We assume that, in the dielectric, the wave propagates away 
from the structure, that is, 


E, = Ase, (12) 


Note that h is expected to have a small positive imaginary part ex- 
pressing the radiation loss in the dielectric. Assuming that « is real, 
that is, that the dielectric is free of dissipation losses, eq. (10b) shows 
that g has a small negative imaginary part. Thus, the wave amplitude 
grows exponentially as the distance to the structure increases. This 
solution of Maxwell’s equations is called a “leaky wave.” 4 It is not 
difficult to show that the curves of constant irradiance in the dielectric 
are straight lines making with the z axis an angle 6 = cos! (h,/kn) 
(Cerenkov angle). 


*A thin dielectric slab with permittivity « and thickness d, supported by a 
magnetic wall, is equivalent to a reactive surface with normalized susceptance 
a = w*(e — €o)uod. An equivalent configuration, obtained by symmetry with respect 
to the magnetic wall, is a thin slab of width 2d with dielectrics symmetrically located 
on both sides. Note that « has the dimension of a propagation constant. 
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From eq. (12), the boundary condition at x = 0 is 
dE/dx — igk = 0, x= 0. (13) 
From eqs. (9a), (11), and (13), we obtain the equation defining x, or h, 
(X — ta) (X + 9) = (X + ta) (X — g) exp (2iXD). (14) 


If we let aD tend to infinity, the reactive surface is uncoupled to the 
dielectric and eq. (14) reduces to X = X, = ia; that is, 


X? = X2 = wre, — 2? = —a?, (15a) 
g’ = 95 Saaz w(e = Eo) Mo a Xi: (15b) 


Equation (15a) defines the propagation constant h, of the uncoupled 
reactive surface. 
Let us now consider 


exp (27X,D) = 6 (16) 
as a small parameter and set 
X=X%+MS+--, 
g=gtnit-, uo 
in eqs. (14) and (10b). Collecting terms of first order in 6 we get 
X1 = 2a (ia — go)/ (ia + go). (18) 
From eqs. (10a) and (17) we have, to first order, 
Im(h) = — (a6/h,) Re (X1). (19) 
Thus the loss £ = Im(h) is 
& = 4a3u-hz "9g, exp (—2aD), (20a) 


or, explicitly, in terms of k, n, D, and a, 


£= Ao®[k? (n2 — 1)}°(R + a) 
x [k2(n? — 1) — a? exp (—2aD). (20b) 


If the micron is used as the unit of length, the loss in dB/km is ob- 
tained by multiplying the r.h.s. of eq. (20b) by 8.7 X 10°. 

This expression for the loss, applicable to small couplings, can be 
obtained alternatively from the equality 


aie ee a [ (¢ - JE Eds / { (B+ x H, — Ep X H+)-dS, (21) 
where (E, H) and h, denote the field and propagation constant of the 
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wave guided by the reactive surface in the absence of the dielectric 
and (E*, H+) denotes the field adjoint to (E, H) (see Part I). (E,, H,) 
and h denote the field and propagation constant in the presence of the 
dielectric. The integral in the numerator extends to the dielectric cross 
section, and the integral in the denominator extends to the whole 
cross section. Equation (21) is exact and is readily obtained from Max- 
well’s equations.* The field (E,, H,), unfortunately, is not known. It 
may differ considerably from the unperturbed field (E, H) when the 
dielectric supports modes almost synchronous with the waveguide 
mode. This is why this expression, eq. (21), is, in general, not practical 
to evaluate the coupling between waveguides, or waveguides and mode 
sinks. The configuration presently considered, however, is sufficiently 
simple to be handled on the basis of eq. (21). 
For our case, eq. (21) becomes, with the approximation h & h., 


h — ho — (wto/2h) [ “(ex En ae 7. i ° de. — (22) 
0 —D 


The unperturbed field, normalized to unity at x = —D, is 
E = exp (Xx) exp (xD). (23) 


The perturbed field is obtained by assuming as before an exp (7g2) 
dependence in the dielectric, matching H and dE/dx at the vacuum- 
dielectric interface (« = 0), and stating that E,X1 at x = —D, 
We obtain 


E, = 2(1 + g/X)71 exp (¢xD) exp (2gz), x20. (24) 


Substituting in eq. (22) and integrating, a result identical to eq. (20) 
is obtained. 

Let us now apply to the same problem the method explained in 
Section II of this paper, which consists in adding the losses associated 
with each mode of the substrate. The coupling coefficient between two 
H modes, with fields EH and E,, was given in Part I. With our present 
notation we have 


C2 = oh / : Bide) (2 7. J Bidz), (25) 


where the integrals are over the whole cross section, and H#, E, are 
defined at some point located between the two waveguides. 


*The contribution at infinity is assumed to vanish. Thus, it is implicitly assumed 
that the rate of decay of the unperturbed field exceeds the rate of growth of the 
perturbed field. This condition is always satisfied for small couplings. 
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The field H of the reactive surface alone is, as we have seen, 
E = exp (—az). (26) 
Thus, at x = 0, 


BE / i ” pide = Debep 0 OeD): (27) 
20 


Let us consider next the dielectric alone and first assume that its 
thickness L, is finite. By matching # and dH/dz at x = Oand x = Ly, 
we obtain the field at the vacuum dielectric interface, and 


+00 Lez 
E? / i E'dx EP / i Bide = Qg?uLz. (28) 
—-0 0 


Substituting eqs. (27) and (28) in eq. (25), we obtain the coupling 
coefficient 
C2 = 4aig?uh;? exp (—2aD)Lz}. (29) 


Let us now evaluate the number of modes (N dh) in the dielectric 
whose propagation constants lie between h and h + dh. Because we 
are far from cut-off, the boundary condition is almost the same as for 
a metallic waveguide, H = 0. Thus, the condition on g is 


Jn = mr/ Lz, m =1,2,--:. (30) 
Using the relation 
Gm = wren, — h, (31) 
the mode number density is, from eq. (30), 
N = hg"LZ./t. (32) 


The radiation loss is obtained from eqs. (29), (82), and (7), andh = ha, 
J = Joy 
& = tN = 4a®uhs ‘go exp (—2aD). (33) 


This result coincides with the result eq. (20) obtained by taking the 
limit of large D in the exact solution. The variation of the loss expressed 
in dB/km is given in Fig. 2 as a function of the normalized susceptance 
a of the surface, for A = 1 um, ¢/e. = 2, and D = 1.5, 1.75, and 2 um. 

For comparison, when the dielectric permittivity has the form 
e = e, + te; (the dielectric is perhaps a lossy foam) and the spacing 
D is chosen as large as consistent with a loss of 10 dB/km at a = 6.28, 
the loss experienced is shown on the same figure as a dotted line. The 
comparison clearly shows the advantage of mode sinking over dis- 
sipation for mode selection. 
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Fig. 2—Radiation loss in dB/km as a function of the normalized surface suscep- 
tance a of the waveguide for a wavelength \ = 1 um, n? = 2, and D = 1.5, 1.75, 
and 2um. The dotted line is applicable to a dissipative dielectric. 


IV. COUPLING TO PLANAR SUBSTRATES 


Let us now consider a waveguide with propagation constant hi 
coupled to a substrate that extends to infinity in the y direction, but 
has a finite thickness in the xz direction. This substrate is perhaps a 
reactive plane (e.g., a corrugated conductor) or a dielectric slab, as 
illustrated in Fig. 3. In any case, homogeneity of the substrate in the 
y, 2 plane is assumed. 

Because of the assumed homogeneity of the substrate, plane wave 
solutions 

E, (a, y, z) = E,(z) exp (thsyy + th,22), (34) 
where 
hse = f(Asy &); (35) 


exist at some angular frequency w (w is now considered a fixed param- 
eter and is omitted). 
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T 
| 





Fig. 3—Dielectric rod coupled to a dielectric slab. The rod field E is shown for the 
spurious Ho: mode, and the slab mode is Hy, (» is a continuous index in the limit 
Ly — »). Coupling takes place at @ ~ 0. 


In the discussion that follows, we consider only waveguide and sub- 
strate modes that are even in y. Assuming that f is even in h,, and 
that the slab is terminated by electric walls, even modes satisfy the 
relation 

heyly = 2nn, n=0,1,2---, (36) 
where L, denotes the width of the substrate. LZ, will be later assumed 
to tend to infinity. The density N of even modes is from eqs. (85) 
and (36) 


N = (df/dhsy)™ (Ly/2r). (37) 
If the substrate is isotropic, with wave vector hs, eq. (35) is 
Nez = fey) = (his — hey), (38) 
and the mode density is, from eq. (37), 
N = (hsz/hey) Ly/2n. (39) 
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The loss is then obtained from eqs. (7) and (89). 
= 3Lu/ f(a) ICL, (40) 


the coupling coefficient C? being evaluated from eq. (6) in Part I. 

It should be noted that, when the propagation constant of the wave- 
guide mode (h;) is just equal to the propagation constant (h,) of the 
2-dimensional substrate, hs, 1s equal to zero and the loss, according to 
eq. (40), is infinite if C?L, remains finite. (This was not the case for 
the 3-dimensional mode sinks considered in Section III because, as 
L,— ©, the field at the surface of the dielectric tends sufficiently 
rapidly to zero to make C?Z, vanish in the limit.) This infinity at 
hy = hs would be removed if some finite dissipation loss in the substrate 
were present. Even in the absence of dissipation losses, the radiation 
loss remains finite at hy = hs, because the perturbation method on 
which eq. (40) is based is no longer applicable. The peak in the loss 
curve predicted by eq. (40) (analogous to a sound barrier) is pro- 
nounced only for small couplings. 

Our general result, eq. (40), is now applied to a dielectric rod coupled 
to a dielectric slab. The thickness and permittivity of the slab can 
always be chosen in such a way that only the fundamental mode of the 
rod propagates without radiation loss. The calculation of the loss of 
higher-order modes is carried out for the case where the rod diameter 
and the slab thickness are very large compared with the wavelength; 
that is, when the rod is highly multimoded in the absence of coupling. 

Approximate expressions for the modes and propagation constant 
in the slab and the rod are given in the next subsections. 


4.1 Modes of the slab 


Let us consider first the modes in the dielectric slab. If the thickness 
2d of the slab is very large (more precisely, if w?(€ — €5)u.d? > 1), the 
propagation constant of the fundamental H, mode is approximately 
given by the condition that the field # vanishes at the boundary 


E(a, 2’) & Es. cos (gs) exp (th,2’). 


Thus, we have* 
gi = weno — h? = (x/2d)?. (41) 


*A more accurate and general expression is (see Ref. 5) gsd = m(x/2)(1 — V~) 
for H modes and g.d = m(x/2)(1 — n-?V—') for # modes, where m = 1, 2--- is the 
mode number and V = ud. These expressions show that the H; mode that we are 
considering in this section is the fundamental mode; that is, the mode that has the 
largest propagation constant. The difference Ah in propagation constants is, for 
m = 1, equal to d-!(a/2knd)?(1 — 1/n?)}. 


TRANSVERSE COUPLING IN FIBER OPTICS Il 685 


(The axial coordinate is denoted 2’ instead of z to avoid changing our 
notation when waves propagating at some angle to the z axis are con- 
sidered. The origin of the xz axis is, in this subsection, at the center 
of the slab.) The axial (2’) and transverse (x) components of the 
magnetic field are, within the slab, as we have seen before 


H, = —h, (wo) 1E, (42) 
Ay (twu.) dE/dx, (43) 


I 


and the power per unit width is approximately 
+d 
Py / EH ,dx = dh,(wu,)—E2,. (44) 
—d 


The field at the boundary is in fact not exactly equal to zero. To 
obtain its value, we use the fact that the dependence of # on zx in 
vacuum is exp (—p.%), where p? = h2 — we,u,, and the continuity of 
dE/dx. We obtain 

E(d) = (n/2d)ps* Eso. (45) 


Now let the slab have a finite width L, with electric walls at 
y = + L,/2. The modes even in y can be described as a superposition 
of two infinite slab waves whose propagation constants are such that 


hey = &2an/Ly n=0,1,2, °°. (46) 


We have, by definition, 


h, being given in eq. (41). 

The field has all its components different from zero with the exception 
of E,, which vanishes. The components £, and H, are obtained by 
adding the field of the two waves. We obtain 


Eyy = 2hezhs' cos (heyy) cos (w2/2d) Eso, (48) 
Hee = —2 (twp) hezhs '(x/2d) cos (Asyy) sin (xx/2d)Eso. (49) 


The energy flowing through the slab is obtained by multiplying P, 
given in eq. (44), by 2h..hs "Ly 


Py = Qhes(ops) ALE, (50) 


The y component of the field at the boundary (x = d) is obtained 
from eq. (45) or directly from H,, = (twy.)0K/0z: 


E,(d) _ — ps wpoH sz. (51) 
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4.2 Modes of the rod 


Let us now turn our attention to the modes of the dielectric rod. 
We assume that the radius a of the rod is much larger than the wave- 
length (a > d). 

In the limit of large radii, the propagation constant of the funda- 
mental HEy, mode is given (see the appendix) by the first root of 
Jo(goa), namely, 


Jot = (wren, — hé)ta = 2.4---, a o, (52) 


The next higher order mode of the dielectric rod is the Ho, mode.* 
In the limit of large radii, the boundary condition at r = ais Hy = 0, 
as for a round metallic pipe. The propagation constant h, is therefore 
given by 

Ji(gia) = 0, (53) 


whose first root is 
gia = (wen, — hi)ia = 3.8---, a> 0, (54) 


Within our approximation, the field of the Ho1 mode in the rod (r < a) 
has components 

Ey = Ji(gir), 
= —hy(wuo) WV 1(917), (55) 
= (twp) "gi 0(917), 


ee 
| | 


and the energy flow is 
Pes I " EyH Qardr = why (ou,)~102J3 (9,0). (56) 
0 


To obtain the field Hy at the boundary (r = a), we use the fact that 
dH/dr is continuous and that the r dependence of Hy, in vacuum is 
approximatelyt exp (— pir) where pj = hi — w*e,u.. We obtain 


E4(a) = py MwpH:. (57) 


4.3 Synchronization conditions 


For simplicity and because this is a case of practical significance, 
we assume that the rod and the slab have the same permittivity e. 


*The Ho and HE2; modes have almost the same propagation constant as the Ho 
mode for large rod radii. For small radiation losses, they can be considered indepen- 
dently of the Ho: mode (see appendix). 

+t The exact dependence of Hg on r is Ko(pir), where Ko denotes the modified 
Bessel function of the second kind. For large arguments, Ko(x) ~ (2/7x)t exp (—2) 
and Kj(z) = — (2/rxz)t exp (—x) ~ —Ko(z). 
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The fundamental HE; mode of the rod is free of radiation loss if its 
propagation constant h, given in eq. (52) is slightly larger than the 
propagation constant h, in the slab given in eq. (41). For simplicity, 
we set hs = h, or, equivalently, g; = g.; that is, 


r/2d = 2.4/a, (58a) 
or 
d = 0.65a. (58b) 


Thus the ratio of the slab thickness to rod diameter is 0.65. (In practice, 
the slab has finite dissipation losses and a finite width. Furthermore, 
it is difficult to control accurately the thickness of the slab. For these 
reasons, it might be preferable to choose the value of h, midway 
between the propagation constants of the HE,, and Ho: modes rather 
than equal to the propagation constant of the HE, mode. If the 
former condition were to hold, we would find that the slab thickness 
should be equal to half the rod diameter.) Figure 4 gives the propa- 
gation constants of the rod and the slab for n = 1.41 and arod radius 
of 10 um (A = 1 um). 

Let us now consider one of the next higher order modes of the rod, 
the Ho: mode. This mode radiates into the substrate modes that have 
the same propagation constant along the z axis (hs, = hi). Using eq. 
(54), we obtain 


weg — h2, = (3.8/a)*. (59) 
Since 
hs, 3 Ney > hi, (60) 
and h, has the value h, given in eq. (52), we have 
hsy = (3.8/a)? — (2.4/a)?, (61) 
or 
hisy = 3-0/a. (62) 


In the next subsection we evaluate the coupling coefficient between 
the Ho: mode of the rod and the substrate mode defined by eq. (62). 


4.4 Coupling coefficient 

The contour of integration for the evaluation of the coupling 
coefficient being arbitrary, it is convenient to choose this contour as 
the rod boundary, r = a. Along that contour, the Hoi mode field is a 
constant. 
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Fig. 4—Propagation constants (h) of the trapped modes of the rod and maximum 
value (h.) of the Propse von constants of the radiation modes in the slab. It is 
assumed that n = 1.41, \ = 1 um, and a = 10 wm. The modes circled are those 
whose coupling is fieeieo in this paper. 


Let ¢ denote the angle from the x axis shown in Fig. 3 and D the 
spacing between the rod and the slab. We have 


x = —D — a(l1 — cos 49), 


6 
y = asin ¢. 08) 


Because a>, the coupling takes place near the point of closest 
approach of the rod to the slab; that is, ¢é } 0. We can therefore write 


xe —D — ad¢*/2, 


(64) 
Yy ~ ag. 


The y dependence of the field slab is cos (heyy) = cos (hsya¢). How- 
ever, since, according to eq. (62), hs, is of the order of a, the argu- 
ment of the cosine function is small compared with unity in the range 
where the coupling is significant. Thus, we can neglect the dependence 
of the field of the slab on y. This approximation could be relaxed with 
little additional complication. 
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Using the above approximation, we obtain for the field of the Ho; 
mode (rod) at r = a, from eqs. (55), (56), and (57), 


A, = (twpo)giJ0 (91a), (65a) 
E,= prone, (65b) 
P = thy (wpo)107J9 (914), (65c) 


where 
gi = weno — hi = (3.8/a)?, 


Di = hi — weoto MY w(€ — €o)Mo = U’. 


For the slab we have, at r = a, from eqs. (49), (50), and (51), 
setting the arbitrary constant EL, equal to unity, hs. ~h, and taking 
into account the exp (p.z) dependence of the field below the slab 


Hy. = —2(iwp.)(r/2d) exp L—p.(D ar ag?/2) ], (67a) 


(66) 


Euy = —ps whol ez, (67b) 
P, = 2h.(wpo.)dLy, (67c) 
with 
h,hh& kn, 
PM wu = k(n — 1)}, (68) 
a/2d = 2.4/a. 


The coupling coefficient C? is c?/PP,, where 
+r 
c= af” [Byles — Bx, cos (¢)H. 19. (69) 


From eqs. (65) and (67), it is apparent that the two terms in the 
integrand in eq. (69) are equal and add up if we make the approxima- 
tion cos ¢ & 1. Thus, 


+00 
c  2aB, i Hdd. (70) 


Using eq. (67a) for H,., we have 
+00 
Hdd = —2 (dep) '(/2d) exp (—psD)(20/p.a)', (71) 
if we make use of the identity 


~foo 
J ody = (w/b), (72) 


Thus, 
¢ = 4apy*gi (two)! (4/2d)Jo(gia) exp (—psD)(27/p.a)', (78) 
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and 
C2 = (32/1) gi (r/2d)*(ueh2aL,)— exp (—2uD). (74) 


Since the mode number density is given by eq. (40), the loss 
L = 3(Nez/hey)C*Ly (75) 
is finally obtained from eqs. (74), (62), (66), and (68), 
& = 340n-(n? — 1)-?(k4a5)— exp [—2(n? — 1)*kD ]. (76) 


The loss in dB/km is obtained by multiplying the r.h.s. of eq. (76) 
by 8.7 X 10°, the wm being used as the unit of length. Thus, for 
n = 1.41 and n = 1.01 we have, respectively, 


I 


LaBikm = 1.85 X 10°F (a/d)— exp (—12.5D/)), n 
Laprkem = 675 X 10°XGn(a/A)— exp (—1.76D/\), n 


1.41, (77) 
1.01. (78) 


I 


For example, if D = 0.15 um, n = 1.41, \ = 1 wm and a = 40 um, 
we find that the radiation loss of the Ho: mode is £ = 2 dB/km. If 
D =1 yum, n = 1.01, } = 1 pm, and a = 40 um, the loss is as high 
as 1140 dB/km. The radiation loss is shown as a function of a/\ and 
D/ in Figs. 5 and 6 for a wavelength of 1 wm, and for n = 1.41 and 
1.01, respectively. The amount of loss required to prevent the power 
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Fig. 5—Radiation loss in dB/km of the rod Ho: mode in the slab as a function of 
spacing D with the rod radius a as a parameter, for trod = Nstab = 1.41. These curves 
are valid for large values of D. 
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Fig. 6—Continuation of Fig. 5 for n = 1.01. 


transferred to the Ho: mode to be transferred back to the HE;; mode 
and to cause pulse spreading depends on the fiber irregularities and is 
not accurately known. 

The above results are approximate and, to some extent, incomplete. 
In particular, the perturbation method that we used is not accurate 
when D is small. Also it would be useful to ascertain that the radia- 
tion losses of the other higher-order modes are at least equal to the 
loss calculated for the Ho1 mode. For some of these higher-order modes 
of the rod, it is necessary to take into account the higher-order modes 
of the substrate, both H# and H, and this involves some complication.® 
In spite of these limitations, our result, eq. (76), should provide pre- 
liminary information concerning the mode-selection mechanism 
afforded by 2-dimensional mode sinks. In particular, the very fast 
dependence of the loss on the rod radius (a~*>) indicates that very 
large rods cannot be used if single-mode operation is to be achieved 
in air. However, if the gap between the rod and the slab is filled up 
with a material whose permittivity is only slightly smaller than the 
rod and slab permittivities, the rod radius a and the spacing D can be 
large, as Fig. 6 suggests. 
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APPENDIX 
Limit forms of the propagation constants in optical fibers 


Two approximations can be made, applicable to low-order modes 
in highly multimoded fibers and to fibers with small transverse varia- 
tion of permittivity. A simplified presentation is given in this appendix. 

Low-order modes propagating in highly multimoded fibers cor- 
respond to waves propagating almost along the axial direction, 2. 
The propagation constant h is therefore close to kn if n denotes the 
refractive index on axis. If the fiber refractive index is a constant 
within some contour and assumes a lower value outside that contour, 
the wave near a section of the contour can be assumed plane. Because 
it is incident at grazing angles, the electric and magnetic fields tend to 
zero compared with their values in the bulk. Thus, the electric and 
magnetic fields at the boundary of a dielectric rod vanish, compared 
to their values on axis, as the transverse dimensions of the rod tend 
to infinity for a given mode number. 

For a round fiber with refractive index n and radius a, the exact 
equation defining h is, using the notation of the main text,® 


Ee " K, (us) || J, (us) i K, (ue) 
Ui (ur) UK, (ue) | uid, (u1) ~~ Ue KK, (ue) 

_ | vh (ut + us) 7? 

-| 7 SEP], oo 
the axial and azimuthal variations of the field being denoted 
exp (thz + tvd) and 


Ui = ga = (k’n? — h?)*a, 
UW. = pa = (? — k*)?a, (80) 
k 


In the limit a> ~, we tends to infinity and the second terms in the 
brackets on the l.h.s. of eq. (79) vanish [K,(x)/K,(x) — —-lifa— @ ]. 
On the r.h.s. of eq. (79), 2 can be replaced by kn. Thus, it is apparent 
that eq. (79) becomes 


Jy(ur)/Jo(u1) = + v/t, (81) 


or, equivalently, using well-known formulas involving Bessel’s func- 
tions and their derivatives :* 


Jyai(u1) = 0. (82) 


w (€ofto)?. 


*We have vJ, F 2J) = rJ,41 and (for later use) vK, + 2K) = FxKy41. 
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v —2 —-1 0 1 2 
EH eS J1=0 Jo=0 Ji =0: HE 
»><0 Be ee ge v>0 
HE: J.1 =0 Jo = J, =0 J: =0 J3 = 0 EH 
[Note: J, = 0<— J, = 0] 


For symmetry reasons, modes with opposite values of v have the same 
propagation constants. For v = 2, for instance, the propagation con- 
stants of the two sets of modes are given by the roots J; and J3. For 
y =—2, they are given by the roots J_; and J_1. However, these are 
the same because J_, = (—)’J,. Equation (82) was given by Snitzer.’ 
For the HE: (v = 1) and Ho: (v = 0) modes, the relevant solutions of 
eq. (82) are the first roots of 


Jo(ui) = 0, Uy = 2.4:-> (83a) 
and 
J1(u1) = 0, U1io = 3.8°°°. (83b) 


These are the results used in the main text. 

Because the modes Ho, Eo, and HE»: have almost the same propaga- 
tion constants (see Table I), the validity of the calculations given in 
the main text can be questioned where the mode Ho was considered 
independently of the two other modes. It is therefore important to 
evaluate the actual splitting between these three modes. For simplicity, 
we consider only the Ho; and Eo: modes. The expressions giving the 
exact propagation constants of the Ho: and Ko: modes are, setting » = 0 
in eq. (79), 


J1(u1)/uJo(ur1) = —Ki(u2)/u2Ko(us), (Ho), (84) 
and 
Ji (u1)/u (ur) = —n?2Ky(u2)/u2Ko(ur), (Eo). (85) 
Setting 
U1 = Uo + 6, 6< 1, (86) 
where 
Ji(uo) = 9, Uo = 3.8-°> (87) 
on the |.h.s. of eqs. (84) and (85) and 
Ue = k(n? — 1)%a (88) 
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on the r.h.s., we obtain for the difference Ah in propagation constant 
between the Ho; and Eo; modes 


Aha = (3.8/kna)?(1 — 1/n?)}. (89) 


Except for a numerical factor, this result is the same as for a slab (see 
Section 4.1). If a = 10 um, n = 1.41, and \} = 1 um, the beat wave- 
length 27/Ah is, from eq. (89), equal to 5 em. 

The individuality of the Ho: mode is preserved and the calculations 
given in the main text are valid if the loss £ is small over that length 
(e.g.. &£<1 dB/em for a = 10 um). In fact, this restriction on £ 
may be even less stringent than that calculated above because the de- 
generacy between the three modes may be lifted further by the pres- 
ence of the slab when the coupling is increased. 

The second approximation referred to at the beginning of this ap- 
pendix is the scalar approximation widely used in optics. If the trans- 
verse variations of the medium permittivity are small, the x and y 
components of the field satisfy approximately the scalar Helmholtz 
equation 


(0?/da? + 0?/dy?)H, + [k?n? (a, y) — WIE, = 0. (90) 


A similar equation holds for E,, which need not be written down. 
Because all quantities are bounded in eq. (90), Hz and its first 
derivatives are continuous functions of x and y. 
For the rod considered earlier, eq. (90) becomes, assuming an 


exp (iu@) dependence of E, on ¢, 
PE ,/dr? + r dE ,/dr + (k?n? — h? — 2/r)E, = 0, r<a, (91) 
@E,/dr + rdE,/dr + (2 — 2 — w/r)E,=0, r>a 


These are differential equations for Bessel functions. The bounded 
solutions of eq. (91) are 


EL, = J,(gr), g? = kn? — b?, r<a (92) 
E, = AK, (pr), p=? — k, r> a. 
Continuity of EF, and dE,/dr imposes 
J (ua)/Ky (us) = (ur/u2)S,(u1)/K, (ue), (93) 
or, using the transformation formulas given before, 
Ur pp1(ur)/ST yp (U1) = U2Ky+1(u2)/Ky (ue), (94) 


a result previously derived by Snyder® from the exact equation, eq. 
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(79). In the limit a > , eq. (94) reduces to 
Ju (us) = 0, (95) 


in agreement with eq. (82). To each value of » we must associate modes 
corresponding to the two states of polarization of the electromagnetic 
field. This is illustrated in Table I. 

The physical significance of the scalar approximation is that if, 
for instance, a linearly polarized field, solution of eq. (90), is launched 
into a fiber, this field configuration is approximately maintained over 
a certain length. Eventually, however, the polarization is transformed 
because the two electromagnetic modes have slightly different real 
propagation constants as we have seen (for a report of experimental 
observations, see Ref. 8 in which the mode u = 1 is illustrated in 
Figs. 3 and 4d) and/or different losses. The scalar approximation is 
useful to obtain approximate expressions for the propagation con- 
stants. This approximation is not applicable to the evaluation of 
radiation losses if these losses are polarization dependent. This is the 
case, for instance, if the propagation constant of the rod mode lies 
between the propagation constants of the slab E and H modes. Because 
the split between these two modes is very small, this is unlikely to 
happen unless the optical waveguide has been specially designed for 
that purpose. In that sense, the scalar approximation may be applied 
to problems of radiation losses. 
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Strip-Loaded Film Waveguide 


By V. RAMASWAMY 
(Manuscript received October 3, 1973) 


Low-loss strip-loaded guides, consisting of 7059 glass film on fused 
quartz substrate with sputtered SiO, as the loading strip, have been in- 
vestigated. The number of modes supported by the strip-loaded structures 
were determined experimentally and compared with the values predicted 
by the application of an equivalent index analysis. Agreement between 
theory and experiment ts good in the case of the smaller number of modes 
which result from small loading, with the 7059 film thickness far away 
from cutoff. 


Current interest in optical fibers and thin film devices for use in 
optical communication systems has prompted the development of 
several new guided wave structures. One of these is a single material 
(SM) fiber! representing an unclad fiber with a structural support. 
Basically, this is a planar slab waveguide structure with an increase 
in slab thickness in the central region where the guided light is con- 
centrated. The region with the increased thickness can be considered 
a strip which loads a planar slab waveguide? 

Another type of a strip-loaded structure is shown in Fig. 1(a) where 
the planar waveguide has the higher index material and a strip of 
slightly lower index material acts as the loading. Since, within the 
region of the strip, most of the energy is confined in the film, require- 
ments on the edge roughness of the strip are no longer as severe as in 
rectangular film waveguides? and therefore strip-loaded structures 
seem easier to fabricate. 

Recently, Noda et al. have demonstrated guiding in curved strip- 
loaded guides using a glass film waveguide with photoresist strip 
loading. They also analyzed the modal distribution of the structure 
by using a variational technique which, however, requires extensive 
computer calculations. 
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Fig. 1—Geometry of (a) strip-loaded film waveguide and (b) equivalent symmetri- 
cal waveguide. 


In this paper we report studies on a strip-loaded guide consisting of 
7059 glass film on quartz substrate with SiO, as the loading strip. We 
use this structure to explore the characteristics of strip-loaded guides 
in greater detail. We confirm the observations of Noda et al. on the 
guiding properties of strip-loaded structures. In addition, we use our 
low-loss structures to determine the number of modes supported by the 
strip-loaded guides and compare these results with the values predicted 
by the application of an equivalent index approach. This equivalent 
index concept is helpful in understanding the guiding characteristics 
of the relatively complex structure in a simple manner. 

Figure 1(a) shows the geometry of a strip-loaded film waveguide. 
Wave propagation effects can be studied conveniently by dividing the 
structure into two regions. Region II represents an asymmetrical 
planar waveguide with n., n,, and n, as the refractive indices of the 
cover, film, and substrate, respectively. D is the thickness of the guid- 
ing film. Region I, on the other hand, includes a strip of superstrate of 
width W, thickness t, and index n,.. 

The energy is confined in the x direction because of the high index 
film. When viewed from above, we see that the structure is symmetrical 
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about the x-axis and can be considered an equivalent symmetrical 
guide [Figure 1(b)] with the loaded section having a higher index, 
thereby providing the confinement of energy in the y direction. We 
assume, however, that this equivalent symmetrical guide is unbounded 
in the x direction. Each mode in regions I and II can be characterized 
by its own phase velocity and the corresponding effective refractive 
index,®> Ny = 61/k and Ny = Br:/k. The difference in effective index 
between the two regions is responsible for the confinement of the energy 
within the loaded section in the plane of the film and is given by 


AN = AB/k = Ny — Nu. (1) 


In order to determine the effective index Ny; in Figure 1(b), we as- 
sume W >i so that region I can be considered a planar 4-layered 
structure. We assume that n, > n., N- > No. Using Maxwell’s equations 
and matching the tangential field components at the interfaces, we can 
obtain the transcendental equation describing the propagation char- 
acteristics of the TE modes 


kD = bco + bs + Mar m=0,1,2,3---, (2) 


where ¢, and ¢,. are the phase changes on total internal reflection at 
the film boundaries given by 


@,; = tan os (3) 
a es ap ree a 
Po tan K 1 + ne? ret , (4) 
and the parameter 
Ye — Yo 
ee 5 
aaAen org (5) 
The transverse propagation constant in each layer is given by 
Yo = B — (kn.)’, (6) 
vo = B — (kn.)?, (7) 
= (king)? — 6, (8) 
Vs = B — (kn,)?, (9) 


where k = w/c = 27/d is the free-space propagation constant and 6 
is the propagation constant in the planar structure of region I. 
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Hig. 2—Dependence of effective refractive index N on film thickness D and loading 
eight t. 


The effective index N can be obtained by solving eq. (2) by means of 
a computer. As an illustration, Fig. 2 shows the behavior of the effec- 
tive index N as a function of film thickness D with the loading height ¢ 
as the parameter for a specific set of values of the layer indices. By 
letting ¢ = 0 in (4), we can obtain the propagation constant 6 of the 
planar guide in region IJ. We note the effective index N(t) for a finite 
t is always larger than the case when ¢ = 0. Therefore, the loaded 
region of the equivalent symmetrical guide [Fig. 1(b)_] has an index NV 
which is higher than that of the unloaded region by an amount AN. 
Figure 3 shows the plot of AN for the same parameters of the waveguide 
structure illustrated in Fig. 2. It is clear from Fig. 3 that, while AN 
increases with the height ¢ of the loading, it does not have to be very 
large to achieve a reasonable value (of the order of 10-7) of AN. In 
fact, the difference in AN between the cases t = 0.34 and t= © is 
indeed very small. 

The total number of modes 7 in the equivalent symmetrical guide is 
obtained from the cutoff condition® as 


p= 1+ 2" (NX) — NO)}} v1 + 2 QNAN)!, AN K1. (10) 
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By using the computed values for N (¢) and N (0) in (10), the number 
of modes in the strip-loaded structure can be determined. 

We made several strip-loaded guides using RF-sputtered 7059 film 
waveguides on glass on fused quartz substrates. The film thickness was 
above cutoff, allowing at least one propagating mode in the film 
without the loading. To avoid using photoresist in the sputtering sys- 
tem, a thin layer of SiO. was sputtered on the 7059 film first and then, 
using photolithographic techniques, the SiO. film was etched every- 
where except over a strip region using buffered HF as the etchant. Etch- 
ing was carried out in small steps to control the depth carefully. The 
loading strip width W was varied from 5 to 12.5 yu. The height of the 
loading strip was 0.1 to 0.4 u, and the values for the thickness of the 
film were chosen to be between 0.2 to 0.5 yu. 

A Gaussian He-Ne laser beam was apertured and used to excite the 
strip-loaded guide by means of a prism coupler placed directly on the 
strip (Fig. 4). Coupling to the strip-loaded guide was easier when the 
height t was small. In addition to the use of a rotating table in the 
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Fig. 3—Difference in effective refractive index between regions I and II vs film 
thickness D. 
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Fig. 4—Propagation of light beam guided in a strip-loaded guide. At the point 
where the strip is scratched, the light is radiated. 


prism coupler arrangement, provision was made to tilt the entire 
assembly in the plane perpendicular to the plane of the table. Each 
mode was excited by varying the position of the beam as well as by 
changing the angle of excitation. Where there are only few modes and 
very little mode conversion, the modes can be identified by viewing 
the far field pattern, and the number of modes can be counted by vary- 
ing the exciting conditions. 

In order to compare the measured number of modes with the predic- 
tions of eq. (10), we determine N and AN from Figs. 2 and 3. For this 
purpose, the refractive index n, and thickness D of the 7059 film was 
obtained by measuring the synchronous angles using a prism coupler.’ 
The thickness ¢ was measured using a Tally-Surf thickness measuring 
machine and the width W by viewing the structure in a microscope. 
The index of SiO, film was assumed to be the same as that of the quartz 
disc used in sputtering. By computing N(t) and N(O), the number of 
modes p was calculated and is shown in Fig. 5. 

Each measured point (A) in Fig. 5 represents one guide structure. 
We find the agreement between theory and experiment is good in the 
case of smaller number of modes which result from small loading with 
the film thickness D far away from cutoff. Since it was rather difficult 
to excite the structure with a large strip thickness using the present 
techniques, investigation of structures with large strip thickness was 
not possible. Moreover, as the number of modes increased, the higher- 
order modes could not be resolved because of mode conversions result- 
ing from imperfections. In the case of higher-order modes, the measured 
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Fig. 5—-Number of modes in the strip-loaded guide. The A points represent the 
experimental results. Note that N and AN are functions of \ to be determined from 
Figs. 2 and 3. 


losses in the structure were as high as 1.5 to 2 dB/em when the strip 
edge roughness was 6000 A and the lowest measured loss was 0.5 
dB/cm for the fundamental mode. It is also interesting to note that 
the edge roughness problem once again becomes important with the 
increased loading resulting from the increase in the energy content 
in the strip. 

A more generalized approach to these new guided wave structures 
has been developed by Marcatili® and our results are in agreement with 
his analysis. 

The author is grateful to H. W. Kogelnik for his encouragement and 
many helpful discussions; in particular, for the suggestion of the 
equivalent index approach. Thanks are also due to E. A. J. Marcatili 
for helpful discussions. The assistance of F. A. Braun and M. D. 
Divino in the fabrication of the structure is greatly appreciated. 
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Rayleigh Scattering and the Impulse 
Response of Optical Fibers 


By D. MARCUSE 
(Manuscript received September 7, 1973) 


The impulse response of multimode optical fibers is distorted because 
each mode carries the signal at a different group velocity. Mode coupling 
tends to reduce the width of the impulse response. Rayleigh scattering, 
being the most fundamental scattering process in optical fibers, serves as 
a mode-coupling mechanism. However, it also causes radiation loss. The 
penalty of a seemingly apparent improvement of the impulse response 
through Rayleigh scattering is calculated in this paper. We conclude that, 
because of the high loss penalty, Rayleigh scattering is not a suitable 
technique for pulse-width improvement. 


1. INTRODUCTION 


The term ‘Rayleigh scattering’’ describes light scattering from re- 
fractive index inhomogeneities whose linear dimensions are much 
shorter than the wavelength of light. Most of the scattered light 
escapes from the core region of the fiber and enters the cladding or the 
space outside of the fiber. Some of the scattered power goes into other 
guided modes. Rayleigh scattering thus contributes to the losses in 
the fiber and also influences the impulse response through mode 
coupling. 

Since mode coupling tends to improve the impulse response of optical 
fibers,!:? the question may be asked: How beneficial is Rayleigh 
scattering for light transmission in multimode fibers because of its 
mode-coupling ability? To answer this question we investigate the 
loss penalty that is incurred if Rayleigh scattering is assumed as the 
only mode-coupling mechanism. 

For simplicity, our study is limited to a slab waveguide model (see 
Fig. 1) assuming that there is no variation of the refractive index or 
the light field in the y direction. Ignoring coupling between guided 
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Fig. 1—Schematic of slab waveguide. The scattering centers are distributed 
randomly throughout the core and the outside medium. They are infinitely thin 
threads of slightly different refractive index extending in the y direction. 


modes traveling in opposite directions, we calculate the width of the 
impulse response and the amount of scattering losses. These calcula- 
tions allow us to establish the loss penalty. We find that the loss 
penalty for any significant pulse-width reduction caused by Rayleigh 
scattering is intolerably high. Thus, it is not feasible to improve the 
pulse dispersion of multimode fibers by intentionally implanting 
Rayleigh scatterers into the dielectric material of the fiber. However, 
improved pulse transmission is obtainable by using other carefully 
engineered mode-coupling mechanisms.’ 


ll. THE COUPLING COEFFICIENT 


The even guided TI modes of a slab waveguide consisting of a 
perfect dielectric are determined by the y component of its electric 
field.? 


E, = A cos xx ea ea (1) 
E, = A cos xd e771 l2I-4) |x| > d. (2) 
The odd guided modes are given by 
E, = A sin xd |eiimad. (3) 
By = 77 A sin cd ewtlai~ |a| >d. (4) 


The magnetic field components are obtained by differentiation: 








_ = aR, 
and 
_ i aby 


The factor exp [7(wt — 8z)] is omitted from these and all subsequent 
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field equations. The width of the core of the slab is 2d. The parameters 
x and y are defined as follows: 


x = (nik? — p?)* (7) 
and 
y = (8? — ngk*)}, (8) 
with k = wVeouo, M1 = core index, and nz = cladding index. The 
propagation constant 6 is obtained as a solution of the eigenvalue 
equations: 
Y 


tan kd = . for even modes (9) 
and 
tan cd = — 7 for odd modes. (10) 
The amplitude coefficient is related to the power P carried by the 
mode : 
2ywpoP ) 
AS t ae) 11 
( (1 + yd) ay 


In addition to guided modes, the slab with infinite cladding has radia- 
tion modes. The magnetic fields of the radiation modes follow from E, 
by means of (5) and (6). The #, component of the even radiation modes 
is! 

E, = Bcosox eld (12) 
and 


By = (He cos Eo(|e| — a +¥] lel >a (3) 


with w defined by 
o sin od 


as p cos od 


(14) 


The amplitude coefficient B is given by 


= 2p*wuoP $ 
= ( m3 (p? cos? cd + o? sin? cd) ) (15) 


The parameters o and p are defined by 


a = (nik? — 6%)* (16) 
and 
p = (ngk? — p?)?. (17) 
Similarly, for the odd radiation modes we have 
E, = Csin ot es ee (18) 
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and 





is x (= 


~ [al \ «Bp 


Phase ¢ is defined by 


)' sin Co(|2| — d) +6] Ic] >d. (19) 


p sin od 


a a cos od 


(20) 


and the amplitude factor is given as 


= 2p’wpoP . (21) 
~ \ rB(p? sin? od + o? cos? od) 


The coupling coefficient between two modes has the form®~? 


a 


Roe sf (n? — n2)E,E*de. (22) 
E, and EF, are the y components of the electric fields of two modes 
labeled v and uw. The index distribution n = n(x, z) describes the wave- 
guide with slight random fluctuations around the average value, and 
No = No(x) is the index distribution that defines the ideal slab wave- 
guide. It is no = m1 in the core and no = nz outside. The ensemble 
average of n? — nj vanishes, 


(n? — ng) = 0. (23) 


The power-coupling coefficients are obtained from the expression? ® 


L L 
ln = Ze fo def de Ku @Kn erm e 


= aS. [ ac [~ dx’ , ae [ dz’ ((n? — ng) (n'? — n¢")) 
x EE, EE ,e* Fs By)(z—-2'), (24) 


The prime indicates quantities depending on z’ and 2’. 

The purpose of this calculation is to study Rayleigh scattering. For 
this reason we may assume that the correlation of the index fluctuations 
reaches only over distances that are much smaller than the wavelength 
2r/8,. The following correlation function is used: 


((n? — ng) (n’? — no”)) = D?((n? — n§)*)8(@ — a!)b(@ — 2’), (25) 


where D is the correlation length of the index fluctuations. Substitution 
of (25) into (24) leads to 





22 ee) 
hon = Geet, D'C(n? — 18)%) [| Hy |*| Ba l*de. (26) 
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The remaining z integration (after integration over the delta function) 
over the distance L resulted in a factor L that canceled from the 
equation. 

To evaluate the remaining integral in (26) we make the following 
assumption. All modes are considered sufficiently far from cutoff so 
that the guided mode fields are very weak at the core boundary, x = d, 
and negligible outside of the core. For a guide supporting very many 
modes, this assumption is justified for most of them. Thus, the integral 
in (26) effectively extends only over the region of the core. The 
integrals are of three different types: 


d 
i= / cos? Kyx Cos? K,x dx, (27) 
—d 
d 
I, = / sin? x,x sin? x,x dz, (28) 
—d 
and 
d . 
i = i cos? K,x sin? x,v dx. (29) 
—d 


Since almost all modes have rapidly oscillating fields in x direction 
inside of the core, we approximate these integrals by 


h=h= hw, (30) 


With the help of (11) and (30) we obtain from (26) 


a ktyvyud 
8(1 + y,d)(1 + yud) 8,8, 


In the spirit of our approximation, we may assume y,d > 1 and 
8B, & mk, where nz indicates the core index. Thus, the power-coupling 
coefficient can be approximated as follows: 

ke D?<( 2. »2)2 y) 
Snid n no)”). (32) 
In this far-from-cutoff approximation, the power-coupling coefficient 
is independent of mode number. Rayleigh scattering couples with 
equal strength all of the modes. 

With the same type of approximations, we obtain from (11), (15), 

(26), and (30) the coupling coefficient between a guided mode labeled 
vy and an even radiation mode with propagation constant 8: 


pkD®((n? — 82) 
8nim |B | (p? cos? od + o? sin? cd) 


Rip D?((n? — nj)?). (31) 


h=hy = 


hy? (8) = (33) 
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Coupling to odd radiation modes leads to the same type of coupling 
coefficient, h{ (8), except that cos od and sin od are now interchanged. 
The power (scattering) loss coefficient for mode » is 


a= 2" ChB) + AB) Mo. (34) 


This expression can be justified as follows. The power-coupling co- 
efficient indicates the amount of power flowing per unit length from 
the guided mode to each individual radiation mode. The sum of the 
contributions to all radiation modes gives the total loss. Since radia- 
tion modes form a continuum, the sum becomes an integral. The 
factor 2 in front of the integral indicates the doubling of the loss 
caused by power flowing not only into forward but also into backward 
traveling radiation modes. The integral over p can be converted to 
integration over @ as follows: 


a = 2" Tr(a) + he (15 ap, (35) 


The integration includes only propagating radiation modes. The con- 
tribution of even and odd modes is very nearly the same, so that we 
use only the coupling coefficient (33) and double the factor in front 
of the integral: 


_ DA (v8 = n8)) pd 


2rny 0 6p’ cos? ad + o? sin? od (36) 


ay 
To the approximation used in this analysis, the power-radiation-loss 
coefficient of the guided modes is independent of mode number. 

An exact solution of the integral in (36) is hard to obtain. If we 
consider the fact that for large values of d the sine and cosine functions 
pass through many periods throughout the range of integration, we 
can replace the integrand by its average value over a few periods 
of the periodic functions. This average is 


eee aoe ee at 
( p” cos? od + o? sin’ od i o (37) 
It now remains to solve the integral: 
nok dg = nek dg = . Ne 
[ ao i car pee ial arcsin ~~ , (38) 


In most cases of practical interest the ratio n2/n1 is very close to unity 
so that we can approximate the integral by 7/2. We thus obtain the 
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following equation for the radiation power loss coefficient 


_ k8D2((n? — n8)?) 
4n, 





= 2nikhd. (39) 


The last part of the equation follows from (32). 


Ill. CALCULATION OF IMPULSE RESPONSE 
Pulse propagation in optical fibers can be described by the following 
equation for the average power?” 


Oy: 5 TOF pws ie vy _ 
oe ae Pee ae ED ou 


This system of coupled power equations holds only for modes traveling 
in the same direction. Rayleigh scattering scatters power in forward 
as well as backward directions; however, we must ignore the back- 
ward scattered power flowing into guided modes. Physically, it appears 
that this approximation should pose no difficulty, since only those 
modes that travel in near synchronism have a chance to interact 
thoroughly. Backward scattered power travels away from the pulse 
that created it; thus, it cannot alter the shape of the impulse response 
except, perhaps, by repeated reflections. Backward scattered power 
contributes mainly to the scattering losses. We have taken backward 
scattering into radiation modes into account, but the additional loss 
caused by backward scattering into guided modes contributes far less 
loss and is ignored in our treatment. Thus, we recognize that the ap- 
proximation may lead to a slight underestimation of the total scatter- 
ing loss. 
To solve (40) we use the trial solution 


P, = A ,e~7ete [t(j 2/0)) | (41) 
Substitution into (40) leads to 


aieay (% 4.) /Ja-o+ wat io(t-™)]. (42) 


We used the fact that the loss coefficients and the coupling coefficients 
are independent of the mode number. The quantity N is the total 
number of modes. 

We obtain the group velocity of the modes from an approximation of 
the propagation constant. Using® 


(43) 


Kyd FY vp 


T 
9? 


RAYLEIGH SCATTERING 711 


we obtain from (7) 


BY Ea _ (5) |: (44) 


The inverse group velocity of mode » is 


1 dB _ 1d _ nik 


de edk ~ eB o) 
Using nk > vx/2d we obtain approximately 
+ w+ a), (46) 
with 
rw 
C= Sana. a0 


The solution of the equation system (42) is accomplished with ease, 
since we realize that the sum term in the numerator is independent of 
the mode label. Thus, the coefficients A, must be of the form 
A, = ——___o __.. (48) 
a— o+Nh+ io" Ge 


Substitution of (48) into (42) leads to an eigenvalue equation for the 
determination of c: 


N 
hy ——_*+ > 21. (49) 
“Nao + NA+ tw > Ge 


The sum can be approximated by the integral 


[- dx = 1 
0 


4 
a —o + Nh + iw™ Ga [io Ga — 6 + NA) | 


4 
iw “* GN? (60) 
X arctan eg NA 


Thus, we obtain from (49) and (50) the eigenvalue equation 


» ny t : 
Ww Ce G N 2 1 Py } 
= a . Md £3 : 
nme Nh = tan | h (io c Gia g+ Nh)) | (51) 
Fortunately, we need only the lowest-order eigenvalue since it has the 
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significance of the steady-state loss of the system of coupled modes 
and also determines the shape of the impulse-response function.?:’ 
The solution of (51) is accomplished by using the fact that the lowest- 
order eigenvalue must be close to the loss coefficient a. Thus, we set 


g=a+7. (52) 


Next, we expand the tangent function in series and solve for 7. In 
this way we obtain the approximate solution 


4 aN 
c=ata (Ge) a ot + to GN, (53) 


For our purposes the coefficient p of w? is of most importance. We 
obtain the general pulse shape by substituting (53) into (41) and 
integrating over w from —o to «. Neglecting the w dependence of 
A,, we find a Gaussian-shaped pulse whose width is?’ 


8 Ni Ni 
At = 4VpL = = — G-— VL. 54 
p ie WE (54) 
The width of the signal in the absence of mode coupling is 
at = (= - ©) = Dome. (55) 
UN v1 Cc 


The relative improvement of the width of the steady-state pulse in 
the presence of mode coupling is expressed by the factor?’ 


At 8 





R= =>] =: 56 

AT V45NhL on 

Using (39) and (56), we define the loss penalty by the expression?” 
Ral = 2.8 mie (57) 


The number of modes is obtained from (44) with the help of the cutoff 
condition 8 = nek for v = N; thus, 


aoe vn? — ne. (58) 


T 


The expression for the loss penalty thus assumes the form 


4.4 
ne 
1 a 


Rial = aoe (59) 
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Iv. DISCUSSION 


We can now answer the question that was asked in the introduction: 
Is Rayleigh scattering significantly beneficial because of its ability to 
shorten the width of the impulse response? Let us assume that we 
have a slab waveguide with a core-to-cladding index ratio of 11/n2 
= 1.01. From (59) we obtain in this case 


RaL = 31.2 = 135 dB. (60) 


We may now ask how much loss is associated with a relative decrease 
of the width of the impulse response by a factor 2, or R = 0.5. We see 
from (60) that the amount of scattering loss associated with this 
‘Improvement”’ is 

al, = 540 dB. (61) 


This shows that if we are hoping for a reduction in the width of the 
impulse response with the help of Rayleigh scattering, we have to pay 
an intolerably high price in added loss. Since Rayleigh scattering 
losses are known to be quite small, (59) indicates that this mechanism 
does not help to reduce the width of the impulse response under 
ordinary conditions. 

We are thus forced to consider Rayleigh scattering as detrimental 
to light transmission in optical fibers. Fortunately, it is a small effect 
that does not provide prohibitively high losses at visible or infrared 
wavelength. 

It is easy to understand why Rayleigh scattering is not more effec- 
tive in reducing the width of the impulse response. It has been shown 
that a very carefully shaped power spectrum of the function describing 
fiber irregularities is required to reduce the loss penalty for pulse 
width reduction.? Rayleigh scattering is particularly poorly suited for 
this purpose since its power spectrum is flat. Only a very small frac- 
tion of the total amount of scattering is used for mode mixing, most 
of it is used for light scattering into radiation modes leading to scatter- 
ing losses. 

Our calculation was based on a slab waveguide model. However, 
the result is expected to be representative of round optical fibers. 
Experience has shown that estimates of the performance of round 
fibers can be obtained from scattering data calculated on the basis of 
a slab waveguide model. 
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Coupling of Nearly Degenerate Modes in 
Parallel Asymmetric Dielectric Waveguides 


By L. O. WILSON and F. K. REINHART 
(Manuscript received May 15, 1973) 


The coupling of modes in two parallel dielectric waveguides is studied. 
The indiidual waveguides are assumed to be asymmetric and unlike each 
other. If the individual waveguides support modes with nearly equal 
propagation constants B. and Bs = Bo + 2A, then the double waveguide 
system will support two new modes with propagation constants B_=B2—6 
and Bs, = Bat 6. The shift 5 is related to A and to the shift 6 which would 
occur tf the original modes were degenerate; 6 is expressed in terms of the 
parameters describing the asymmetric double waveguide system. The field 
distributions of the new modes are approximately even and odd combina- 
tions of those of the original modes in the isolated waveguides; the relative 
amplitudes with which they are combined depend upon the amount of 
mismatching A. As the modes travel down the waveguide system, they 
partially cancel and add, thus transferring power. A power transfer ratio 
F is defined and ts shown to decrease rapidly as A/6 increases. The beat 
length L depends upon both 6 and A/6; it also decreases as A/6 increases. 
A numerical example is given to illustrate the effects of mismatching and 
to demonstrate the feasibility of constructing a mode-coupling device. 
Possibilities of tuning the device to reduce mismatching are discussed. 


I. INTRODUCTION 


Coupling of degenerate modes of parallel optical waveguides has 
been discussed by Kapany! and, to a greater extent, by Marcuse.’? 
Such coupling is of particular interest in the field of fiber optics, since 
it may cause undesirable crosstalk between adjacent optical fibers used 
for light transmission. Marcuse? has applied the theory of degenerate 
mode coupling to the problem of crosstalk between cladded optical 
fibers embedded in a lossy medium and between cladded dielectric 
slab waveguides. The fabrication of devices which would actually 
take advantage of mode coupling, such as for light switching, modula- 
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tion, or power transferral,’ is fraught with practical difficulties, since 
the specification of physical parameters must necessarily be stringent. 
These difficulties require us to view the theory of optical waveguide 
coupling from a new vantage point. 

Let us first sketch briefly what is known. If two optical waveguides 
each have a mode with the same propagation constant 8, then when 
the two waveguides are placed parallel to each other, the double wave- 
guide system supports two new modes whose propagation constants 
are By = 8 + 6 and B_ = 6 — 6. These two modes are approximately 
symmetric and anti-symmetric combinations of the original modes in 
the isolated waveguides. The shift in propagation constant, 6, is 
related to the coupling coefficients involved in a description of the 
modes by means of general coupled line equations. It can also be ex- 
pressed via a perturbation treatment of Maxwell’s equations. Since 
the superimposed modes travel down the double waveguide system at 
different phase velocities, they alternately add and cancel. If the wave- 
guides are lossless, power is transferred back and forth over a beat 
length L = 7/(26). On the other hand, Marcuse shows that, if the 
waveguides are lossy, they tend to equalize the power they carry, 
provided the modes travel far enough. A lossy external medium also 
causes mode loss. Marcuse further states that only degenerate modes 
exchange a significant amount of power if their coupling mechanism 
is independent of length. 

With this abbreviated version of the present theory in mind, we see 
several criteria which a mode coupling device should satisfy: (7) the 
core and cladding of each waveguide should be lossless, (77) the medium 
external to the waveguides should be lossless, and (777) the two wave- 
guides should have a degenerate mode. (There are also other criteria, 
such as that the waveguide walls be free from imperfections, but they 
are not discussed.) 

The first criterion is an important one and certainly merits further 
study. In this paper, however, we avoid the issue by assuming that the 
device we fabricate has lossless waveguides. A subtler way to put this 
is to say that the device is short enough that losses can be ignored. 

The second criterion is satisfied by assuming that the claddings of 
the two waveguides are contiguous and that there is no medium ex- 
ternal to them. Instead of thinking in terms of two optical fibers, we 
consider two dielectric slab waveguides placed next to each other. 
Fabrication would be similar to that currently used in the production 
of double heterostructure lasers and modulators.*> Each waveguide 
will consist of a slab of high refractive index surrounded by two slabs 


718 THE BELL SYSTEM TECHNICAL JOURNAL, APRIL 1974 


of lower index. Since the double waveguide device will have a central 
slab common to both waveguides, the device can be modelled by a 
5-dielectric-slab model. 

The third criterion, that the two waveguides have a degenerate mode, 
motivates our present study. In practice, it is very difficult to fabricate 
a device with degenerate modes. It is therefore quite important to 
know how well the device will operate if the propagation constants for 
the modes are slightly mismatched. We study the effect of mismatching 
on the beat length and on the capability of the device to transfer power. 
We also discuss methods of tuning the device after it is fabricated. The 
tuning could be used to match the propagation constants more closely. 
It might also be used dynamically, thus offering the possibility of 
utilizing the double waveguide system as a light switch or a modulator. 


li. FORMULATION 


We adopt the standard slab model of an absorptionless dielectric 
medium. The optical dielectric K (x), i.e., the square of the refractive 
index, is assumed to vary only with x and to take the piecewise con- 
stant form shown in Fig. 1. If the waves are assumed to travel in the 





x 


Fig. 1—The optical dielectric profile K (x) for the five-slab model. 
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z-direction with propagation constant 8, then the electric and magnetic 
fields are independent of y and can be expressed as 

E = e(x) exp 72z(wt — 82), 

H = h(x) exp i(wt — £2), 
where w is the angular frequency of the light and ¢ is the time. Both 


TE and TM modes exist. It follows from Maxwell’s equations that the 
electric field e,(x) of a TE mode is described by 


de, 





dx? ot Lk2K (x) ~~ Ble, = 0, (1) 
and that the magnetic field h,(x) of a TM mode is described by 
afi thy 2 y= 
K@) & (gay Gt) + ERG] + Oh, = 0. (2) 
It is required that 
de, 
Cy, dx ? 
1 dhy 
” K(x) dx 


be continuous. Since K(x) is piecewise constant, solution of eqs. (1) 
and (2) subject to the above conditions is straightforward. The 
solution of (1) is 


éy(x) = A exp pix xr<0 
= A[(pi/pz2) sin pox + cos por | 0< 2 < 2u, 
= AC[1 + (p1/p2)T2][—X sinh ps(a@ — 2we) 
+ cosh p3(x — 2we) | Qwe < x < 2(we + ws) 
= AC2C3[1 + (pi/p2)T2 JL. — XT] 
X [(p3/p4) Y sin pa(a — 2we — 2ws) + cos pala — 2we — 2ws) J 
2(we + ws) < x < 2(we + ws + wa) 
= ACCCaL1 + (pi/p2)T2 JTL — XT3] 
x [1 + (ps/pa) YT 4] exp ps(2we + 2ws + 2ws — 2) 
2(we + ws + ws) < x, (3) 


with 
pi) = (@- KK) 4 = 1,8,5, 4 
pi(8) = (kK; — 6?)} a= 2, 4, 
C'2(8) = cos 2p2we, T2(8) = tan 2po2We, 
C3(8) = cosh 2p3ws, T3(8) = tanh 2p3ws, (5) 
C4(B) = cos 2paw,, T4(8) = tan 2pswa, 
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—~ 1 pels —pi_ 
MEPS P3 | 1 + (p1/pe)T2 | (6) 


~ i pal's — ps . 
6) = Ps 1 + (ps/pa) 7. | 7) 


The amplitude A is arbitrary. Equation (8) satisfies the continuity 
condition on e,(x) everywhere, and that on de,/dz at all but the point 
x = 2(we + ws). The continuity condition at this point leads to the 
eigenvalue equation 


X (8) + Y(8) 


T'3(8) = i+ XY)’ 


(8) 


which determines the values of the propagation constant 6 for which 
discrete modes can exist. 

For TM modes, the analogous equations are formed by replacing 
pi by p; and w; by @;, where 


K.p(f) = (@—hK)' 7 =1,3,5, 
K;p.(8) _ (PK; _ B?)} a 2, a, 
DW; = Kyw; 1 2, 3, 4, 


and 8 denotes the propagation constant for a TM mode. For simplicity 
of exposition, we have mainly confined our analysis to that of TE 
modes. It should be clear how to do the corresponding analysis for 
TM modes. 

We remark that the above analysis is quite general and makes no 
assumptions about the relative heights or widths involved in the 
dielectric profile K(x) sketched in Fig. 1. By making appropriate 
choices of the parameters, we could deduce from eqs. (8) to (8) the 
corresponding equations for a single asymmetric or symmetric wave- 
guide, for example. Two cases which interest us particularly are: (2) 
Ks = Ky = Ks with Ko > Ky and Ke > Ks, and (12) Ky = Ko = K3 
with K, > K; and K, > K;. Each of these models an isolated wave- 
guide. We call the first of these (with high dielectric region Ke) guide 
II and the other (with high dielectric region K,4) guide IV. The eigen- 
value equation (8) then reduces to 


X(@) =1 guide II, 
Y(g) = 1 guide IV. 


These may appear more familiar to the reader when cast in the standard 
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form for a single asymmetric guide® 


(pi/p2) + (ps/pe) 
1 — (p1/p2) (ps/p2) 


(p3/ps) + (ps/ pa) 
1 — (p3/ps) (ps/pa4) 


For the model of two adjacent waveguides, we place guides II 
and IV adjacent to each other as illustrated in Fig. 1, with 
Ky > max {K,, K3}, Ks > max {K3, Ks}, and w; > 0 (¢ = 1, 2, 3, 4). 

It is also a relatively straightforward procedure to write down 
precisely how many modes can exist with a given dielectric profile 
K(«).7 Although we omit such expressions here, we do comment that, 
just as a single asymmetric guide may not be able to support a propa- 
gating mode, so also an asymmetric double waveguide structure is 
not always capable of mode propagation. If K, = K3; = Ks, though, 
so that the structure is composed of two parallel symmetric (but not 
necessarily identical) waveguides, then there is always at least one 
mode. 


tan 2wep. = guide II, (9) 


II 


tan 2wsp4 guide IV. (10) 


lil. NEARLY DEGENERATE MODES 


Let 62 and 8, denote solutions of X(8) = 1 and Y(@) = 1, respec- 
tively ; 82 and @,, then, are propagation constants for modes in guides 
II and IV if the guides were isolated from each other. We need make 
no assumption about the order of each mode. In practice, though, both 
propagation constants are likely to be associated with zeroth order 
modes. For definiteness in notation, we assume that 8, 2 82 and write* 


Bs — Bo = 2A. 


We now assume that A is ‘‘small,”’ 1.e., that the two modes are nearly 
degenerate. This assumption, which is fundamental to the remainder 
of the analysis, is stated more explicitly later [eq. (18) ]. Frequently, 
it does not matter (to the order of approximation used) whether 6, or 
84 is used in the evaluation of an expression. In such instances, it 
sometimes is helpful to use the notation Bo(= B2 = 4). 

Our task is now to determine values of @ which satisfy (8). Our 
experience with the degenerate case (A = 0) leads us to expect that 


“In the numerical example of Section VI, we relax this notation to read 
|Bs — Ba| = 2A, where it is not known a priori whether 82 or f, is larger. This 
should not be confusing when taken in context. 
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there will be two solutions 6; and 8_ close to Bo. A study of the coupled 
line equations® for the two modes would demonstrate that 6, = Bs + 6 
and @_ = B, — 5, where 6 is expressed in terms of the (unknown) cou- 
pling coefficients.* We prefer to attack (8) directly ; we shall verify the 
expressions for 6, and 6_, prove that 6 > 0, and give an explicit formula 
for 6. 

We first show that, if 82 and 6,4 are close enough together, then (8) 
has no solution 8 such that 6. S$ B S Bs. Since X(@.) = 1 and 
X’(8) <0 for all 8, we know that if (8—6.)/@.<1, then 0<X(@) <1. 
Similarly, Y(8) = 1. Thus, 


X (6) + ¥(@) _1+X@Y7"@ ., 
1+ X(8)¥(6) X(8) + Y7(6) ~~ 


But 73(8) = tanh 2w3p3 < 1 for all 8, so (8) is not satisfied. 

Next, suppose (8) has a solution 6, = 64 + 6, with 6 > 0. Then if 
5 is small, we know that 0 < X(@,) < 1 and 0 < Y(8,) <1, so (8) 
may be written as 


T (8) = tanh w3p3 = tanh [$(tanh— X + tanh Y) ] 
Oe Ce ia ne Cee ol ey 
X¥FO+a-x) 1+ 0—-P 
We have 
X (64) = X(B2 + 2A + 6) + 1+ (24 + 6)X"(Bo), 


Y (64) = Y(Bs + 6) = 1 + 6Y’(G). (12) 


If we substitute (12) in (11) and perform a perturbation analysis 
under the two assumptions, 


[5(2A + 5)X’ (Bo) ¥’ (Bo) ]} «1, (13) 
[— (2A + 6)X’ (Bo)? + [—8Y’ (Bo)! <1, (14) 

we find 

T (Bo) = 1 — [—(2A + 8) X" (Bo) J1L— 6 Y’ (Bo) }, 

so that 

5 = —A + (A? + 8}, (15) 
where 
ay SD Bo). 

5 = TRTGDY Bol ) 
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On the other hand, if we suppose that (8) has a solution B_ = B. — 6 
with 6 > 0, then X(8_) > 1, Y(6_) > 1, and (8) becomes 


T (8) 


d 


tanh w3p3 = tanh [3(tanh—! X— + tanh7! Y-) ] 


X+(X2-DI+ V4 (¥2-D) 
1+ oxt+ Drs 4 


I 


By a procedure very much like that used to determine 6 = 6, — Bu, 
we find that 6 = 6, as anticipated. Here, the roles of X’ and Y’ must 
be interchanged in (14). 

The effect, then, of placing guides II and IV next to each other is to 
shift their (isolated) propagation constants 62 and 6, symmetrically 
outward by 6 to B_ and 8. The physical meaning of 6 in (16) is clear: 
it is the magnitude of the shift which would occur if guides II and IV 
had degenerate modes (A = 0). We shall call 6 the “‘degenerate shift.”’ 

Let us consider assumptions (13) and (14) in more detail. By means 
of (15) and (16), (13) becomes 


1 — T(Bo) = 1 — tanh wsp; < 1. (17) 


This, then, is essentially a restriction on the separation between the 
two waveguides. If they are too close, our approximations will break 
down. Assumption (14) and its counterpart with the roles X’ and Y’ 
reversed are, by (15), satisfied if 


[A + (A? + 8)? PEL —X’ (Bo) FF + L-Y'@Bo)F} KI. (18) 


This tells us how large A can get without invalidating the approxi- 
mations. 

By using eqs. (4), (6), and (7) cleverly, we see that the expressions 
for X’(8o) and Y’(@o) reduce to the simple forms 


— Bo( ps + p3) 


X" (Bo) = Dieins [2piwe + 1 + (pi/ps) J, (19) 
- 2 2 
Y" (Ba) = PERE PY Popes +1 + (ps/ps). 20) 


Thus, by (16), 
a_i  g, W0) 
Bol ——(p3-+-p2) (i+p0| 2pwos+1+22]| 2psver1 +2 |} 
Pips D3 Ps 
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If we take the case of two identical symmetric waveguides 
(pi = P3 = Ps, P2 = Ps, We = Was) and use the approximation 


tanh wsp1 = 1 — 2 exp (—2wsp,), 


then (21) reduces to 
_ _ Pipz exp (—2wsp1) 22) 
Bo(pi + p2)(1 + piwe) 


which is in agreement with results of Marcuse.? For TM modes, we 
arrive at 


~ ~ ~ 


By = Bs t+ 4, B= Br — = By — Bs, 
where 
PS hae eye 
g rs 1—- tanh Wsp3 
LX" (Bo) Y" (Bo) }’ 
2°(B) = Fapiekeag ae | QIK Io. 
Kipi + we) Pr (23 + K3p3 )| 
i( B+e ) tS b+ B ; 
Roa oe! _ BR 2 a 
P'(Bs) = jrariats cee” | AKIK Bott 
Kipi + Kip? Ds ( K3p3 + Kipi 
= s( B+ )'RB\ BLE 


IV. A LOOK AT THE MODES 


We now discuss what the modes es.(2) associated with 6, are like in 
guides II and IV. The expressions for e;(x) are given by (3). Both the 
shapes of the modes and their relative amplitudes will be of interest. 

We see from (8) that the shape of e,(x) in guide II is given by 


f (Bz, ) = C(pi/pe2) sin pox + cos pot] a, (23) 
Since 
1(B1, 2) = (By 2) + (2A + 8) 4 41 


f(6-, 2) = fx 2) - sal 


the shapes of both modes differ just ae from the unperturbed 
shape f(@2, x); furthermore, the shifts for the two modes are unequal 


MODE COUPLING INWAVEGUIDES 725 


and are in opposite directions. The unperturbed shape can be deter- 
mined with the aid of (9). We find 


Pi _ tan | ps —- 5 (tax PL _ tan-1 P8 )| = tan U 
Pe 2 Pe Pe 


for even-numbered modes and 


Pt — — cot U 
P2 


for odd-numbered modes. Thus, 


f(82, z) = tan U sin pox + cos por 


sec U cos (pox — U) 


2 2)\4 
= eT ae | pee are 
2 


ee | 
—>5[ tan? — — tan? — 24 
3( po De ve 
for even-numbered modes and, similarly, 
2 2) 4 
f(B2, %) = AEB sin | pee — Wepre 


— ; (tan BS — tan“ 7 ) | (25) 
for odd-numbered modes. The mode shapes in guide IV can be deter- 
mined in an analogous manner, with perturbations performed about 
G4 instead of Bo. We leave the details to the reader. 

The above results are not surprising. If the double waveguide 
system has a mode with propagation constant 6, or 8— which is close 
to the propagation constants 82 and 8, of modes that can travel in the 
individual isolated waveguides, then we would indeed expect the shape 
of that double waveguide mode to deviate only slightly in each wave- 
guide from the shape of the mode that could propagate in the isolated 
waveguide. 

The amplitudes of e:(x) in guides II and IV prove to be more inter- 
esting. Let the arbitrary amplitude A in (8) be written as Ax for 
e,(x), and let B# and BY denote the amplitudes of the modes in guides 
IT and IV, respectively. Then (3) and (24) or (25) show that the mode 
amplitudes in guide II are given (to our order of approximation) by 


2 2\3 
BY a2 Ay (pi + p32) : (26) 
2 
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In guide IV, the amplitudes are, by (3), 


2 2) 
BY = A, | C20: (2 a = T) (1 — XT) ac Th) (27) 
2 4 





Bs, 


Care must be taken in the evaluation of C3(1 — X73) at By. Since 
X (B+) = 1, Y (64) < 1, and by (8), 


2 ee = 1 
Ts = fy py = tanh (tant X + tanh“ ¥), 





we have 
= se ee _ 1+ XY 
C3 = cosh (tanh 1X + tanh Y) = Gd — x1 — YA! ; 


so that 





=) 


6(t= 2TH) |p, 2 G =) |, * [ eae] 


5Y’ 





We find in a similar manner that 


5X’ } 
C31 — xT = — | ——.— |- 
Thus the mode amplitudes in guide IV are 
(03 + pi)} | = x) 
Pp 5Y’ 
(p3 + pi)? 6X’ 
ps (2A + 6)Y’ 


BY = AC (1 + -s 7’) 
2 


BY = — 4.¢,(1+ 214) |: (28) 
Pe 

We observe that the mode amplitudes will have the same signs in one 
waveguide and the opposite signs in the other. Thus, e;(x) might be 
termed quasi-even and e_(x) quasi-odd. More startling, however, is 
the realization that, if A > 0, then the ratio |BY/BU| = A,/A_ may 
be quite different from the ratio | BYY/B'Y| = (A,/A_)[(24 + 6)/6]. 
, As we see in the next section, this will have serious implications when — 
we consider the double waveguide system as a device for transferring 
power. 

For future reference, we write e,(z) for a system consisting of two 
symmetric, but not necessarily identical, waveguides (K, = K3; = Ks). 
In this instance, (9) and (10) imply that 


Coll + (pi/p2)T2] = 1, 
Cull + (pi/ps)T 4] = 1, 
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so that we have by (8) and our previous analysis 
é4(z) = Ay exp pix zx<0 


2 2} 
= Ay PEE PD cos pale — wn) 0O<2< 2ue2 
2 


= A,[—X sinh pi(z — 2w2) + cosh pi(x — 2w2)]| 1, 
Qw. << x < 2(we + ws) 


S “7k 2 2\4 

-4,[ 25°92] MS DON a cease C2 ap nays 
6 Y Pp 

2(we + w3) < & < 2(we + wz + wa) 


=| Oe 


3 
: v7 | exp p1(2we + 2w3 + 2ws — x) 


2(we + w3 + wa) <2, (29) 
e.(z) = A_exp pit x <0 


2 2)4 
= APP cos pa(o — ws) 0< 2 < Qu, 
2 


A_[—X sinh pi(a — 2we) + cosh pi(x — 2we)]| 6. 
2we <a < 2(we + ws) 
sai. bX’ | (pi + pa)} 
(2A + 6)¥’ ps 
2(we + ws) < 4% < 2(wWe + ws + wa) 


cos p4(x — 2we — 2w3 — wa) 


I 


5X’ 4 
— AL Oa +)Y exp pi(2we + 2ws + 2w, — 2) 
2(we + ws + ws) < x. (30) 


V. BEAT LENGTH AND POWER TRANSFER 


Suppose the two modes Hy = es(x) exp 7(wt — 642) travel down the 
double waveguide device. Since they travel at different phase velocities, 
the quasi-even and quasi-odd modes will alternately add and (par- 
tially) cancel in each waveguide. Hence, power is transferred between 
the two waveguides. 

The beat length Z over which this transfer takes place is given by 

L => —— “s => ples Ere a. ° 31 

2(6+ A)  28L1 + (4/5)? ]} a 

Note that, if the degenerate shift 6 is fixed, then as the mismatching A 
increases, the beat length Z decreases. We can conceive of ways to 
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tune the double waveguide device and thus to change the beat length. 
This might be useful for light switching or modulation. 

It is important to learn just how much power can be transferred 
in the waveguide system. Suppose, for definiteness, that we excite just 
one waveguide at z = 0 (say, guide IV), with the intent of transferring 
power to guide II via the mode-coupling mechanism. If guides II and 
IV have degenerate modes (A = 0), then as the modes travel down 
the waveguide system, they will alternately add and then cancel (to 
order 6?) in each waveguide, with addition occurring in one wave- 
guide when at the same position cancellation occurs in the other. If 
guides II and IV have nondegenerate modes (A > 0), however, then 
complete cancellation cannot take place in both waveguides: by (27) 
and (28), we see that if the amplitudes of e; (x) and e_(x) are adjusted 
so that the modes cancel at z = 0 in guide II, then they will never 
cancel fully in guide IV. 

From a practical point of view, a parameter which is likely to be of 
interest in this matter is the fraction of the total power introduced into 
the system which can be transferred into guide II. If the modes are 
poorly confined, an appreciable fraction of the power carried by a 
waveguide may actually be outside the high dielectric guiding region. 
If the reader is interested in the fraction of the power which can be 
transferred not only to the guiding region of guide II, but also to its 
vicinity, we would need a power transfer ratio G to be defined by 


G = a : [ex (x) + e_(x) Pdx if [ [e2. (x) + e2 (x) Jdz, 


where a is some number between 2w: and 2(w2 -+ ws) which defines 
the “boundary” between guides II and IV. The numerator of this 
expression, then, is proportional to the power carried by the entire 
guide II. 

Unfortunately, for a general asymmetric waveguide system, it is 
not at all clear how to define the position of the “boundary” between 
the two waveguides. If the system consists of two symmetric wave- 
guides which are nearly identical (except for a small deviation if the 
modes are slightly mismatched), then it seems clear that the boundary 
should be midway between the two dielectric regions, i.e., at 
a = 2we + w3. By using (29) and (30), we find in this instance that 
we have to first order 


@ = [1+ (ap. 


Thus for perfectly matched waveguides (A = 0), the power transfer 


MODE COUPLING IN WAVEGUIDES 729 


is complete, to first order. As the mismatching increases, the power 
transfer ratio decreases rapidly. 

Complete power transfer (to first order) is a direct consequence of 
assumption (17), which implies little overlap between the field as- 
sociated with guide II and that associated with guide IV. A higher 
order perturbation analysis would show that in fact there is some field 
overlap and that, even if A = 0, the power transfer is not complete. 
As the waveguide separation increases, there would be less field overlap 
and the power transfer would be more nearly complete. 

For a general asymmetric waveguide system, we might define the 
“boundary” between the two waveguides to be, say, at the position 
where the “quasi-even”’ field attains its minimum. Such a definition 
can be cumbersome to apply mathematically. In general, though, we 
would expect results similar to those obtained for the symmetric 
system. If the modes are degenerate and one waveguide is excited, then 
virtually all the power can be transferred to the vicinity of the other 
waveguide. The power transfer ratio decreases as the mismatching 
increases. 

It will be instructive to introduce a second power transfer ratio F, 
which can be defined precisely. It will be the fraction of the total power 
introduced into the system which can be transferred into the high 
dielectric region of guide II, the waveguide which was originally un- 
excited. If terms of order A/8o are neglected, this power transfer ratio 
is defined by 


P= [" Ces) + (@) Fae / [(&@) +&@lz, (2) 


where the mode amplitudes A, and A_ are equal. 

If the modes are poorly confined in guide II, the power transfer 
ratio F may be considerably less than unity even if the waveguides 
are perfectly matched (A = 0). The definition of F is concerned only 
with the power which can be transferred into the high dielectric region 
of guide II; hence, F depends upon the confinement factor of the 
waveguide (to be defined below) as well as upon the amount of mis- 
matching A. 

Evaluation of (32) can be very messy for the general case of two 
asymmetric waveguides. We simplify the subsequent analysis and yet 
retain its essential flavor by assuming that the double waveguide 
structure is composed of two symmetric, but not necessarily identical, 
waveguides (K; = Kz; = Ks). The modes e,(x) are then given by (29) 
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and (30). For the numerator of F’, we find 
2we 2 2 2we 

i [e, (2) + e_(x) Pda = 4A, A i COS? po(X& — we)dx 
0 2 0 


2 2 
= 44% eee [powe + 3} sin 2pow» |. 
2 


But since (9) implies that 





2pip2 
2 


sin 2p.w. = 
sig pi + ps 


for a symmetric waveguide, we have 


2w2 2 2 2 
i [er (x) + e_(x) Pdr = “i | pa: ( PL) + Bal. 
0 P2 P2 Pe 


Similar procedures can be used in the evaluation of the denominator 
of F. Special care must be taken in evaluating the integral in the 
interval 2w, < x < 2(we + ws3), where e,(x) and e_(x) have the same 
functional description, but the function is evaluated at 6, and B_, 
respectively. In the resulting analysis, C3; and S; must be evaluated at 
6... Procedures similar to those used following (27) are helpful. The 
power transfer ratio turns out to be 


es 4 (Pit/p2) 
20 (Pir/p2) + (1/pi)] + 20 (Piv/ps) + (1/p1) J. + 24/8) X'/Y"’ 


where 


2 2 
Py = pus (EP ) eee 
P2 Pa 
2 2 
Pry = pavs (PP) tae (33) 


The expression for F can be simplified. By using (19), (20), and 
(33), we find that 


Py 1 xX’ (= 1 ) 
ne ae ey 34 
po i YON pe bar (34) 
so that 
F = Ei as (35) 


[(Pu/p2) + G/pi) JL 4 (4278) ] 


Both (84) and (35) have interesting physical interpretations. In 
order to discuss them, we make a brief digression. Suppose that, 
instead of the double waveguide system, we just have an isolated 
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guide IJ, which for generality we assume is not necessarily symmetric. 
(In the notation of Section II, we have K3; = K, = K;.) If the electric 
field is given by e(x) exp 7(wt — 622), then the power carried by the 
entire waveguide is proportional to 


P= [i etaar 


and the fraction of the total power which is confined to the high 
dielectric region is given by 


C= ee e(ade / ia e? (x) dx. 


It is not difficult to show that 


P= ae ps [2piwe + 1 + (pi/ps) J, 
where e(x) was assumed to have unit amplitude at x = 0. Upon 
comparing this expression with (19), we find that 


baa _ 2B (pa + ps) 
ps (pi + p3) 
so that X’ is related in a simple manner to the power carried by the 
waveguide with which it is associated. Now if the isolated guide II 
happens to be symmetric, then it is also true that 


Pu 
Pe pr’ 

5 pee 

P1 

Since a similar relationship holds for an isolated guide IV, (34) follows. 
The physical implication of (384) is that, if a single waveguide is ex- 
cited in a matched double waveguide system (A = 0), then the power 
is distributed between the two modes in such a manner that 


[4 @ae = [2 war. 


“eo —~o 


P= — 


If an isolated guide II is symmetric, then the confinement factor C 
can be shown to be given by 


P u/ Pe 
“= (Paipsy = lipa 2) 
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Hence, the power transfer ratio F' is simply given by 
F = Cl + (4/6)? }". (37) 


Note that F is completely independent of the parameters of guide IV, 
as well as of w3 because of assumption (17). 

We remark once more that these results are first-order approxima- 
tions. If a higher order perturbation analysis were undertaken, it 
would show that F is also dependent upon the amount of field overlap 
between the two waveguides. 

It is important to note that the amount of mismatching A can have 
a significant effect on the power transfer ratio: as A increases, F de- 
creases rapidly. We might remark that, while the beat length L 
depends upon 6 and the ratio A/é, the power transfer ratio F depends 
only upon A/6. Thus, by proper device design, it may be possible to 
adjust 6 and A/6 to get both appreciable power transfer and a desirable 
beat length. 


VI. A NUMERICAL EXAMPLE 


To illustrate our results, let us give an example, using parameters 
which could be realized in a GaAs—Al,Gai_,As heterostructure. We 
consider two waveguides which are each symmetric, but which have 
different widths and dielectric step heights. Our intent is to excite one 
waveguide and then to transfer power into the other by means of mode 
coupling. The parameters of one waveguide, say guide II, are taken to 
be fixed. The width of the other waveguide is considered a variable. 
For any given width 2w., we adjust the dielectric height K, so that 
the propagation constants 62 and #4 for the zeroth order TE modes of 
the two waveguides match. 

We learn how the degenerate shift 6 varies with the spacing 2w; 
between the two waveguides and with the width 2w, of guide IV. We 
look at the beat length Z and the power transfer ratio F for an idea of 
how much mismatching of the propagation constants 62 and 64 can be 
tolerated. We then discuss the amount of mismatching 2A = |84 — B2| 
which might occur in a practical situation and show how tuning can 
reduce this. 

To be specific, suppose that 


Ke = 11.868, 
K, = 0.9K, = 10.681, 
w. = 0.1 um, 


k = 5.4494 X 104 em. 
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The dielectric constant K,» corresponds to an index of refraction 
Ki = 3.445 of GaAs at a wavelength \ = 27 k- = 1.153 um. The 
halfwidth w, of guide IV will be assumed to vary between 0.1 um and 
0.8 pm. 
For the zeroth order TE mode of a symmetric waveguide, we find 

that (9) reduces to 

tan pow, = p1/P2, 

Pi = (83 a k?K)}, 

Po = (kh? Ky — B3)}. (38) 


The above parameters for guide II give us 


B2 = 1.8049 & 105 cm“, 
pi = 2.9311 X 104 cm“, 
pe = 5.1631 X 10* cm. 


Equation (38) holds for guide IV if each subscript 2 is replaced by a 4. 
For any given value of ws, we adjust K4 so that 6, is the same as fo. 
Some values of K, and p, are given in Table I. 

The parameters for Ki, Ke, w. and any given pair K4, w, then define 
two waveguides which have a degenerate mode at the given wave- 
length \. The degenerate shift 6 is given by (21) for symmetric wave- 
guides: . 

pe. pipsps(1 — tanh wsp1) 
2B2L (pi + p2)(pi + pi) (1 + piw2) (1 + piws) }} 


Figure 2 shows 6 as a function of w3 for various values of w4. We see 
that the coupling decreases rapidly as the waveguide separation in- 
creases or as the width of guide IV increases. The beat length Z, then, 
will increase rapidly with w3 or w4if A is small enough. If, however, the 


Table | — Values of K, (and hence p,) needed to match £, for the 
zeroth order TE mode in guide IV (of halfwidth w4) 
to the 8» for the corresponding mode in guide II 


was (um) K, ps (104 cm7) 
0.1 11.868 5.1631 
0.2 11.381 3.4917 
0.3 11.222 2.7340 
0.4 11.145 2.2763 
0.5 11.100 1.9619 
0.6 11.071 1.7295 
0.7 11.051 1.5494 
0.8 11.037 1.4048 
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Fig. 2—The degenerate shift 5 as a function of half the distance between the two 
waveguides w;3, with the halfwidth w, of guide IV as a parameter. 


waveguides cannot be fabricated to match as closely as desired, the 
story can be different. If we take, quite arbitrarily, A = 100 cm™!, we 
would find for example from (31) and the data in Fig. 2 that if 
Ws, = 0.6 um, 


0.113 0.4 
_ | 0.1387 : _ 10. 7 oy 
L= 0.150 ( ™™ if w;= ogf um (A = 100 em“) 
0.157 0 


while if A = 0, 


0.162 0.4 
L=<,0.279>mm if ws = + 0.5 + um (A = 0). 
0.490 0.6 
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Thus, Z is reduced significantly and changes less rapidly with w; if A 
is large enough. Similar results are found if w; is fixed and wz varies. 

Next, suppose we wish to excite one waveguide and to transfer 
power to the other one. If the propagation constants for the two 
waveguides were perfectly matched (A = 0), then by (35) to (87) 
the power transfer ratio F would be the confinement factor C’ which 
is plotted in Fig. 8. The upper curve is used if guide II is excited and 
power is transferred to guide IV; the lower curve is used if guide IV is 
originally excited and power is transferred to guide II. The power 
transfer ratio is larger if power is transferred from a narrow guide II 
to a wide guide IV than if the power is being transferred the other 
way, since the power is more tightly confined within the guiding region 
in the wider waveguide. 

Lest the reader become confused, we recall that F is defined as the 
fraction of the total power introduced into the system which can be 
transferred into the high dielectric region of the guide which was 
originally unexcited. F does noé concern itself with how much power 
in the high dielectric region of one guide can be transferred into the 
high dielectric region of the other guide. 


1.0 


rT —» WV 
0.8 
0.6 
Cc 
W-I 
0.4 
0.2 
0 
0 0.2 0.4 0.6 0.8 1.0 


Wy IN um 


g. 3—The confinement factor C for degenerate modes. Guide II is fixed, and the 
halfeidth wa of guide IV varies. 
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Again, the mismatching A can have a significant effect. If, for ex- 
ample, we assume that w, = 0.6 wm and transfer power from guide II 
to guide IV, we find from (35) (evaluated with the parameters for 
guide IV) that if A = 100 cm™, 


0.66 0.4 
F = 0.44 if w3 = < 0.5 > um. 
0.22 0.6 


(The maximum value of F’ is 0.91 for A = 0.) 

If we know that the fabrication procedure will likely make A of 
significant size, then the only way (for given waveguide parameters) to 
get good power transfer is to make 6 large enough. A trade-off thus 
must be made between good power transfer and a long beat length. 

Fortunately, tuning can be a viable alternative to making such a 
trade-off for badly matched waveguides. To tune a device which has 
already been fabricated, we would need to alter one or more slab 
widths or dielectric heights. 

Some possible methods of tuning are to change K, or K, by altering 
the free carrier density or using the electro-optic effect, or to change the 
outer slab levels Ki or K; by diffusion or ion implantation. This can 
be achieved, in principle, by growing at least one waveguide with a 
small gradient in the slab width. Phase matching then can be achieved 
by lateral positioning of the light beam, which travels approximately 
perpendicular to the gradient of the slab width. 

We compute a possible value of A for a specific example and then see 
how much tuning is needed to reduce A to zero. Suppose that (for any 
given w,) the double waveguide device is fabricated according to the 
specifications for matched propagation constants, but that there are 
slight errors in we, ws, K, — Ky, and K4 — Ky. Assume the following 
errors, which are probably reasonable if the device is fabricated by 
molecular beam epitaxy:> the ratio w2/w,4 is nearly constant, and wy» 
varies by +0.02 um; the ratio (K, — K,)/(K4 — Ky) is nearly con- 
stant, and K, — K, varies by +0.10. We shall take K2 to be fixed. 
Then the extreme cases would be given by w. = 0.12 (0.08) um, 
Ki, = 10.781 (10.581), and 


Ws = (Ws/W2) Wa, 


ee Oe K,— K, K, — Ky 
Ka = Ki — (Gea) Ki + (Gee ) Ke 


where the primed parameters refer to the values in the fabricated 
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Fig. 4—The mismatching A as a function of the ratio of the guide widths. 


device. We just treat the extreme case for which w, = 0.12 um and 
K, = 10.781, since the other gives changes of comparable size. 

We find from (88) that, with these fabrication errors, 62 = 1.8156 
X 105 cm and 6,4 varies from 1.8156 X 105 em= for ws, = we to 
1.8119 X 10° cm™ for wy =8wy. The resulting values of A=4|84—£2| are 
plotted in Fig. 4. Although both 2 and #8, have changed significantly 
from the value 1.8049 X 105 cm7 for which the device was designed, 
they both change in the same direction, so A reflects a less radical 
change. 

In tuning the system, suppose we consider guide IV, and hence @,, 
as fixed for any given w.; we shall alter either the dielectric height or 
the symmetry of guide II to adjust .. 

If we lower K, to make 62 match 84, we find by (88) that the altered 
values Ky are those given in Table II. The change in K, is thus less 
than 0.4 percent. Such a change is feasible with free carrier injection 


Table Il — Values of K. needed to tune the mismatched 
waveguide system 
w/w? K4 % change 

1 11.868 0 

2 11.846 0.19 
3 11.840 0.24 
4 11.831 0.31 
5 11.829 0.33 
6 11.825 0.36 
7 11.824 0.37 
8 11.823 0.38 
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Table Iil— Values of K,; needed to tune the mismatched 
waveguide system 


w',/Ws Ki % change 


10.781 
10.716 
10.697 
10.670 
10.663 
10.657 
10.647 
10.643 


ONO wWNe 
ete ey Se 
wri Rone 


and is about an order of magnitude larger than can be handled com- 
pletely by the electro-optic effect. 

Another possible method of tuning would be to make guide II 
asymmetric. If we keep K2 = 11.868, K3 = 10.781, w2 = 0.12 um, and 
alter K,; to match 82 to B14, we use (9) in the form 


a po tan 2wop. — p3 
‘1+ (ps/p2) tan 2wyp. 


to get p, and then use (4) to find K;. The altered values K, are given in 
Table III. The change is thus no more than 1.3 percent. This can be 
handled by diffusion or ion implantation. 
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